Our fulltext data sets contain both the complete text (often digitised and OCRd) as well as metadata about documents. 

Books

Description

All copyright free books that the National Library has digitised from its collections.

User interface
Doria
APIs

Doria OAI-PMH (use parameter set=com_10024_109240)
Doria OpenSearch
(use parameter scope=10024/109240, e.g. search laplanders )
Individual documents may be downloaded from Doria.

License
CC0 for most titles, with few exceptions as CC-BY

Classics Library

Description

A collection of classic Finnish fiction from 19th and 20th centuries.

User interface
Doria
APIs

Doria OAI-PMH (use parameter set=col_10024_88083)
Doria OpenSearch
(use parameter scope=10024/88083, e.g. search rakkaus)
Individual documents may be downloaded from Doria.

License
CC0

Collection Catalogues

Description

Digitized catalogues and card files of the National Library collections. Collections are not fully catalogued in the library databases, hence the old card files and catalogues can provide supplemental information on the collections .

User interface
Doria
APIs

Doria OAI-PMH (use parameter set=com_10024_111861)
Doria OpenSearch
(use parameter scope=10024/111861, e.g. search machine)
Individual documents may be downloaded from Doria.

License
CC0

Dissertations of the Royal Academy of Turku

Description

This collection contains 4173 digitized dissertations that were defended at the Royal Academy of Turku between 1642 and 1828. The collection also includes a number of Pehr Kalm's dissertations.

User interface
Doria
APIs

Doria OAI-PMH (use parameter set=col_10024_50699)
Doria OpenSearch
(use parameter scope=10024/50699, e.g. search aquae)
Individual documents may be downloaded from Doria.

License
CC0

Ephemera Collection

Description

A digitised collection of ephemera from the legal deposit collections of the National Library of Finland. Subject matters include tourism, protection of animals, war-time rationing, women's movement, etiquette, sports, board games and vehicles. Publication dates range from early 19th century to 1944.

User interface
Doria
APIs

Doria OAI-PMH (use parameter set=com_10024_85119 except for the board games use set=col_10024_121989)
Doria OpenSearch
(use parameter scope=10024/85119 or 10024/121989 for the board games, eg.g. search chrysler)
Individual documents may be downloaded from Doria.

License
CC0

Fenno-Ugrica

Description

Fenno-Ugrica is a digital collection of publications in Uralic languages. The Fenno-Ugrica collection includes more than 1500 monographs and over 110 newspaper and journal titles in 20 languages. The collection also features word lists, which are generated from the digitized and edited books by language. Zip-files with full-text and images are included with some of the titles.

User interface
Fenno-Ugrica
APIs

Fenno-Ugrica OAI-PMH, and direct link to the OAI-interface 
 OpenSearch, e.g. search анатомия  on the books collectionIndividual documents may be downloaded from Fenno-Ugrica.

License
Public domain based on due diligence agreement, Certificate is available in http://s1.doria.fi/ohje/img-603112949-0001.pdf

Finnish Civil War And Independence

Description

A selection of ephemera from the events of 1917 and 1918 in the midst of Finnish civil war. The collection offers documents on the Red Guards, the White Guard, inserts for the newspapers, declarations and food supply.

User interface
Doria
APIs

Doria OAI-PMH (use parameter set=com_10024_111871)
Doria OpenSearch
(use parameter scope=10024/111871, e.g. search mannerheim)
Individual documents may be downloaded from Doria.

License
CC0

Finnish journals -1929

Description

Digitised collection of generic journals in Finland until 1920.

Detailed description (in Finnish)

Note that 1918-1929 is opened by agreement with Kopiosto and National Library for year 2018. (Newsletter)

User interface
Journals at digi.kansalliskirjasto.fi
Data downloads

Zip packages, which custom XML contains metadata, ALTO XML, and raw text of a page.

Data package contains journal material until 1910.

APIs
Digi OAI-PMH
License
Terms of use (in Finnish).

Finnish newspapers' layout analysis (METS package) 1771-1917

Description

The layout analysis files from digitisation for Finnish newspapers, years 1771-1917.

Zip packages, which contain METS XML for each binding. METS xml standard contains layout information of the materials and technical processing information.

Note! Due to improvements in materials, the few years back created ALTO XML export packages are not fully in sync with the METS information. I.e. some binding id's that exist in ALTO exports can be missing from METS, which have been generated in early September 2018.

 

User interface
Newspapers at digi.kansalliskirjasto.fi
Data downloads
https://digi.kansalliskirjasto.fi/opendata/submit  Pick (Other)
APIs
 
License
Terms of use (in Finnish).

Finnish newspapers 1771-1929

Description

Digitised collection of newspapers published in Finland from the 18th century up until 1929.

Note that materials of 1918-1929 is opened by agreement with Kopiosto and National Library for year 2018. (Newsletter)

User interface
Newspapers at digi.kansalliskirjasto.fi
Data downloads
Zip packages, which custom XML contains metadata, ALTO XML, and raw text of a page. Data packages contain material of newspapers until end of 1917 and journals until end of 1910.
APIs

Digi OAI-PMH

Digi OpenURL

License
Terms of use (in Finnish).

Fragmenta Membranea Collection

Description

The Fragmenta membranea collection contains the vast majority of the remains of books written and used in the eastern parts of medieval Sweden, the Diocese of Turku. The Fragmenta membranea database contains 9,319 digitized parchment leaves meaning 18,638 pages which come from approximately 1,500 different medieval manuscripts.

User interface
Fragmenta membranea
APIs

Fragmenta OAI-PMH
OpenSearch (e.g. search Gloria)
Individual documents may be downloaded from Fragmenta Membranea.

License
CC0

History of the books

Description

A broad collection of books and other texts from the 18th and 19th centuries ranging from devotional books and broadside to educational material and fiction. There are also catalogues from book actions.

User interface
Doria
APIs

Doria OAI-PMH (use parameter set=com_10024_144026)
Doria OpenSearch
(use parameter scope=10024/144026, e.g. search rakkaus)
Individual documents may be downloaded from Doria.

License
CC0

Manuscript collection

Description

Digitised material from the Manuscript Collection. The material includes Medieval and sixteenth-century manuscript books, Mannerheim's Fragment Collection, Paul Scheel's letter collection, parchment Letters and Väinö Raitio’s musical manuscripts. Also the main card index of the Manuscipt Collection is available.


User interface
Doria
APIs

Doria OAI-PMH (use parameter set=com_10024_109242)
Doria OpenSearch
(use parameter scope=10024/109242, e.g. search Mannerheim)
Individual documents may be downloaded from Doria.

License
CC0

Maps and Atlases of Finland

Description

A collection of digitized maps about Finland ranging from the 16th century to 20th century. Map types include Town maps, general maps of provinces and regions, nautical charts, town and parish maps, and Atlases.

User interface
Doria
APIs

Doria OAI-PMH (use parameter set=com_10024_78800)
Doria OpenSearch
(use parameter scope=10024/78800, e.g. search Turku)
Individual documents may be downloaded from Doria.

License
CC0

Nordenskiöld Map Collection, The

Description

A selection of digitized maps  from the Nordenskiöld Collection. The maps depict the development of Western countries' geographical knowledge. They cover all continents, with a particular emphasis on Arctic areas. There is an almost complete series of the Geographica, the classic cartographic work by Claudius Ptolemy, as well as a considerable number of works related to the discovery of America.

User interface
Doria
APIs

Doria OAI-PMH (use parameter set=com_10024_97216)
Doria OpenSearch
(use parameter scope=10024/97216, e.g. search Belgia)
Individual documents may be downloaded from Doria.

License
CC0

OCR Ground Truth Package for Finnish Fraktur

Description

OCR Ground truth package for Finnish Fraktur.

Package contains 450 page images and ALTO XML files for each page, with the proofreading done by the Finnish native speakers. The pages of fraktur range from the year 1836 until 1910. The package can help in creating own postcorrection algorithms for OCR text recognition.

There is also an Excel file for all of the 471 903 words, which contains result given to the word by Tesseract and FineReader. If a tool hasn't found corresponding word, then the given cell is empty, so select the words in the Excel, which you need.


NB! The ground truth package does not contain the data for the 1918 due to copyright reasons.

User interface
 
Data downloads
http://digi.kansalliskirjasto.fi/opendata  
API
-
License
Terms of use (in Finnish).

Raita: Early Finnish Recordings

Description
Raita is a collection of digitized early Finnish sound recordings.
User interface
Doria
APIs

Doria OAI-PMH (use parameter set=col_10024_66373)
Doria OpenSearch
(use parameter scope=10024/66373, e.g. search Verdi)
Individual documents may be downloaded from Doria.

License
CC0

Technical Ephemera Collection

Description

Digitised collection of technical ephemera (selection of brochures, ads, leaflets, price catalogues and instruction guides)

Detailed description (in Finnish)

User interface
Ephemera at digi.kansalliskirjasto.fi
Data downloads
-
API
-
License
 

 

  • No labels