Open data sets available for reuse


Fulltext

Our fulltext data sets contain both the complete text (often digitised and OCRd) as well as metadata about documents.

  • BooksAll copyright free books that the National Library has digitised from its collections.
  • Classics LibraryA collection of classic Finnish fiction from 19th and 20th centuries.
  • Collection CataloguesDigitized catalogues and card files of the National Library collections. Collections are not fully catalogued in the library databases, hence the old card files and catalogues can provide supplemental information on the collections.
  • Digitalia data packages
  • Dissertations of the Royal Academy of TurkuThis collection contains 4173 digitized dissertations that were defended at the Royal Academy of Turku between 1642 and 1828. The collection also includes a number of Pehr Kalm's dissertations.
  • Ephemera CollectionA digitised collection of ephemera from the legal deposit collections of the National Library of Finland. Subject matters include tourism, protection of animals, war-time rationing, women's movement, etiquette, sports, board games and vehicles. Publication dates range from early 19th century to 1944.
  • Fenno-UgricaFenno-Ugrica is a digital collection of publications in Uralic languages. The Fenno-Ugrica collection includes more than 1500 monographs and over 110 newspaper and journal titles in 20 languages. The collection also features word lists, which are generated from the digitized and edited books by language. Zip-files with full-text and images are included with some of the titles.
  • Finnish Civil War And IndependenceA selection of ephemera from the events of 1917 and 1918 in the midst of Finnish civil war. The collection offers documents on the Red Guards, the White Guard, inserts for the newspapers, declarations and food supply.
  • Finnish journals -1929Digitised collection of generic journals in Finland until 1920.
  • Finnish newspapers' layout analysis (METS package) 1771-1917The layout analysis files from digitisation for Finnish newspapers, years 1771-1917.
  • Finnish newspapers 1771-1929Digitised collection of newspapers published in Finland from the 18th century up until 1929.
  • Fragmenta Membranea CollectionThe Fragmenta membranea collection contains the vast majority of the remains of books written and used in the eastern parts of medieval Sweden, the Diocese of Turku. The Fragmenta membranea database contains 9,319 digitized parchment leaves meaning 18,638 pages which come from approximately 1,500 different medieval manuscripts.
  • History of the booksA broad collection of books and other texts from the 18th and 19th centuries ranging from devotional books and broadside to educational material and fiction. There are also catalogues from book actions.
  • Illustration base type classifier model fileIllustration base type classifier model file for newspaper, journal etc. illustration categorization.
  • Manuscript collectionDigitised material from the Manuscript Collection. The material includes Medieval and sixteenth-century manuscript books, Mannerheim's Fragment Collection, Paul Scheel's letter collection, parchment Letters and Väinö Raitio’s musical manuscripts. Also the main card index of the Manuscipt Collection is available.
  • Maps and Atlases of FinlandA collection of digitized maps about Finland ranging from the 16th century to 20th century. Map types include Town maps, general maps of provinces and regions, nautical charts, town and parish maps, and Atlases.
  • Nordenskiöld Map Collection, TheA selection of digitized maps  from the Nordenskiöld Collection. The maps depict the development of Western countries' geographical knowledge. They cover all continents, with a particular emphasis on Arctic areas. There is an almost complete series of the Geographica, the classic cartographic work by Claudius Ptolemy, as well as a considerable number of works related to the discovery of America.
  • OCR Ground Truth Package for Finnish FrakturPackage contains 450 page images and ALTO XML files for each page, with the proofreading done by the Finnish native speakers.
  • OCR Ground Truth Package for Swedish FrakturPackage contains page images and ALTO XML files for each page, with the proofreading done in Swedish  by the Finnish native speakers.
  • Raita: Early Finnish RecordingsRaita is a collection of digitized early Finnish sound recordings.
  • Technical Ephemera CollectionDigitised collection of technical ephemera (selection of brochures, ads, leaflets, price catalogues and instruction guides)
  • Tesseract3 Finnish fraktur modelTesseract 3 Finnish fraktur model
  • Uusi Suometar (1457-4721) ALTO XMLALTO XML files of newspaper Uusi Suometar (1457-4721) years 1869-1917.
  • Uusi Suometar (1457-4721) REOCR ALTO XMLThe REOCR'd ALTO XML files of newspaper Uusi Suometar (1457-4721) years 1869-1917.

Metadata

Our metadata repositories contain records that describe documents, but the actual documents are stored elsewhere. For electronic or scanned documents, we provide a link to the document content.

  • ArtoARTO is the reference database and metadata reserve of Finnish periodical and monograph articles.
  • FennicaFennica - the Finnish National Bibliography is a database dedicated to Finnish publication activities.
  • Fennica-LDThis is the Linked Data version of Fennica - the Finnish National Bibliography. 
  • Finna.fiFinna is a search service that aggregates metadata from Finnish archives, libraries and museums.
  • MelindaMelinda is the National Metadata Repository (union catalog).
  • ViolaViola is the Finnish national discography and the national bibliography of sheet music.

Vocabularies

Controlled vocabularies such as thesauri, ontologies and authority files are referenced from metadata descriptions. We provide access to controlled vocabularies mainly through the Finto http://finto.fi service.

  • Allärs - Allmän tesaurus på svenskaAllärs is the Swedish translation of YSA - General Finnish Thesaurus.
  • FGF - Finnish genre and form vocabularyFGF (in Finnish known as SLM) is a bilingual (Finnish + Swedish) vocabulary for describing genres and forms of literature (both fiction and non-fiction) and music.
  • Finnish Corporate NamesThe Finnish corporate names data set is used by the National Library of Finland in the description of the national bibliography Fennica.
  • ISIL Identifiers of Finnish LibrariesAn ISIL identifies a library, an archive, a museum or a related organization, or one of its subordinate units.
  • KOKO OntologyKOKO is a collection of Finnish core ontologies, which have been merged together.
  • Metadata thesaurusMetadata thesaurus contains terms and expressions required in describing materials. The thesaurus is also suitable for selecting headings displayed in user interfaces.
  • PLC - Finnish Public Libraries Classification SystemThe classification system for Finnish public libraries PLC (known by its Finnish acronym YKL) is an adaptation of the Dewey Decimal Classification.
  • SEKO - Finnish Medium of Performance ThesaurusSEKO is a thesaurus in Finnish which covers instruments, voices etc. used in the performance of musical works.
  • UDC SummaryThe Multilingual Universal Decimal Classification Summary.
  • YSA - General Finnish ThesaurusYSA is a general thesaurus in Finnish which covers all fields of research and knowledge, which contains the most common terms and geographical names used in content description.
  • YSE - concept suggestions for YSOYSE is a collection of concepts that have been suggested for inclusion in YSO, but they have not yet been accepted.
  • YSO - General Finnish OntologyGeneral Finnish Ontology YSO is a trilingual ontology consisting mainly of general concepts.
  • YSO placesYSO places is a multilingual gazetteer.


  • No labels