You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 5 Next »

Fulltext

Books

Copyright free books that the National Library has digitised from its collections.

User interface
Doria
APIs

Doria OAI-PMH (use parameter set=com_10024_109240)
Doria OpenSearch
(use parameter scope=10024/109240, e.g. search laplanders )
Individual documents may be downloaded from Doria.

License
CC0 for most titles, with few exceptions as CC-BY

Classics Library

A collection of classic Finnish fiction from 19th and 20th centuries.

User interface
Doria
APIs

Doria OAI-PMH (use parameter set=col_10024_88083)
Doria OpenSearch
(use parameter scope=10024/88083, e.g. search rakkaus)
Individual documents may be downloaded from Doria.

License
CC0

Collection Catalogues

Digitized catalogues and card files of the National Library collections. Collections are not fully catalogued in the library databases, hence the old card files and catalogues can provide supplemental information on the collections.

User interface
Doria
APIs

Doria OAI-PMH (use parameter set=com_10024_111861)
Doria OpenSearch
(use parameter scope=10024/111861, e.g. search machine)
Individual documents may be downloaded from Doria.

License
CC0

Digi collection texts and metadata

Metadata of digitized collections texts and metadata

Description

Metadata of digitized collections texts and metadata

User interface
digi.kansalliskirjasto.fi → Collections
Data downloads
Digi.kansalliskirjasto.fi/opendata -page

Select file:  Digi collection texts and metadata [v1](106.2 kB)
APIs


License
Terms of use

Digitalia data packages

Digitalia (2017-2019)

Uusi Suometar (1457-4721) REOCR ALTO XML

Uusi Suometar (1457-4721) ALTO XML

Dissertations of the Royal Academy of Turku

This collection contains 4173 digitized dissertations that were defended at the Royal Academy of Turku between 1642 and 1828. The collection also includes a number of Pehr Kalm's dissertations.

User interface
Doria
APIs

Doria OAI-PMH (use parameter set=col_10024_50699)
Doria OpenSearch
(use parameter scope=10024/50699, e.g. search aquae)
Individual documents may be downloaded from Doria.

License
CC0

Ephemera Collection

A digitised collection of ephemera from the legal deposit collections of the National Library of Finland. Subject matters include tourism, protection of animals, war-time rationing, women's movement, etiquette, sports, board games and vehicles. Publication dates range from early 19th century to 1944.

User interface
Doria
APIs

Doria OAI-PMH (use parameter set=com_10024_85119 except for the board games use set=col_10024_121989)
Doria OpenSearch
(use parameter scope=10024/85119 or 10024/121989 for the board games, eg.g. search chrysler)
Individual documents may be downloaded from Doria.

License
CC0

Fenno-Ugrica

Fenno-Ugrica is a digital collection of publications in Uralic languages. The Fenno-Ugrica collection includes more than 1500 monographs and over 110 newspaper and journal titles in 20 languages. The collection also features word lists, which are generated from the digitized and edited books by language. Zip-files with full-text and images are included with some of the titles.

User interface
Fenno-Ugrica
APIs

Fenno-Ugrica OAI-PMH, and direct link to the OAI-interface 
 OpenSearch, e.g. search анатомия  on the books collectionIndividual documents may be downloaded from Fenno-Ugrica.

License
Public domain based on due diligence agreement, Certificate is available in http://s1.doria.fi/ohje/img-603112949-0001.pdf

Fin-Clariah dataset - Copyright-free Finnish newspapers and periodicals

Digitised collection of copyright-free newspapers and periodicals published in Finland. This dataset is available via Allas-service in CSC via Fin-clariah project.

Description

Digitised collection of copyright-free newspapers published in Finland. This dataset is available via Allas-service in CSC via Fin-clariah project.   See detailed instructions here.


Dataset id links from Fin-Clariah dataset to metadata records can be found below.

User interface
Newspapers at digi.kansalliskirjasto.fi
Data downloads
  • Newspapers until 31.12.1918
  • Journals until  31.12.1912
  • Copyright free books 
APIs

Digi OAI-PMH

https://digi.kansalliskirjasto.fi/interfaces/OAI-PMH?metadataPrefix=oai_dc&set=col-861&verb=ListIdentifiers

License
Terms of use

Finnish Civil War And Independence

A selection of ephemera from the events of 1917 and 1918 in the midst of Finnish civil war. The collection offers documents on the Red Guards, the White Guard, inserts for the newspapers, declarations and food supply.

User interface
Doria
APIs

Doria OAI-PMH (use parameter set=com_10024_111871)
Doria OpenSearch
(use parameter scope=10024/111871, e.g. search mannerheim)
Individual documents may be downloaded from Doria.

License
CC0

Finnish journals -1939

Digitised collection of generic journals in Finland until end of 1939.

Description

Detailed description (in Finnish)

Note that 1921-1939 is opened by agreement with Kopiosto and National Library for year 2023.

User interface
Journals at digi.kansalliskirjasto.fi
Data downloads

Zip packages, which custom XML contains metadata, ALTO XML, and raw text of a page.

Data package contains journal material until 1910.

APIs
Digi OAI-PMH
License
Terms of use

Finnish newspapers' layout analysis (METS package) 1771-1917

The layout analysis files from digitisation for Finnish newspapers, years 1771-1917.

Description

Zip packages, which contain METS XML for each binding. METS xml standard contains layout information of the materials and technical processing information.

Note! Due to improvements in materials, the few years back created ALTO XML export packages are not fully in sync with the METS information. I.e. some binding id's that exist in ALTO exports can be missing from METS, which have been generated in early September 2018.

User interface
Newspapers at digi.kansalliskirjasto.fi
Data downloads
https://digi.kansalliskirjasto.fi/opendata/submit  Pick (Other)
APIs

License
Terms of use (in Finnish).

Finnish newspapers 1771-1939

Digitised collection of newspapers published in Finland from the 18th century up until 1939.

Description

Note that materials of 1918-1939 is opened by agreement with Kopiosto and National Library for year 2023

User interface
Newspapers at digi.kansalliskirjasto.fi
Data downloads
Zip packages, which custom XML contains metadata, ALTO XML, and raw text of a page. Data packages contain material of newspapers until end of 1917 and journals until end of 1910.
APIs

Digi OAI-PMH

Digi OpenURL

License
Terms of use

Fragmenta Membranea Collection

The Fragmenta membranea collection contains the vast majority of the remains of books written and used in the eastern parts of medieval Sweden, the Diocese of Turku. The Fragmenta membranea database contains 9,319 digitized parchment leaves meaning 18,638 pages which come from approximately 1,500 different medieval manuscripts.

User interface
Fragmenta membranea
APIs

Fragmenta OAI-PMH
OpenSearch (e.g. search Gloria)
Individual documents may be downloaded from Fragmenta Membranea.

License
CC0

History of the books

A broad collection of books and other texts from the 18th and 19th centuries ranging from devotional books and broadside to educational material and fiction. There are also catalogues from book actions.

User interface
Doria
APIs

Doria OAI-PMH (use parameter set=com_10024_144026)
Doria OpenSearch
(use parameter scope=10024/144026, e.g. search rakkaus)
Individual documents may be downloaded from Doria.

License
CC0

Illustration base type classifier model file

Illustration base type classifier model file for newspaper, journal etc. illustration categorization.

User interface
  • nlf_basetype_classifier.pb
  • nlf_basetype_classifier_labels.txt

 

Some examples of concept here: https://blogs.helsinki.fi/digitalia/?s=tensorflow&submit=Search

Classifier model file can be used with TensorFlow (https://www.tensorflow.org/guide/saved_model )


When using the file, please cite:

https://digi.nationallibrary.fi , Digital Collections of National Library of Finland, Illustration classifier model file of Digitalia, 30.9.2019.

Data downloads
http://digi.kansalliskirjasto.fi/opendata  
API
-
License
Terms of use

Manuscript collection

Digitised material from the Manuscript Collection. Versatile material includes Medieval and sixteenth-century manuscript books, Mannerheim's Fragment Collection, Paul Scheel's letter collection, parchment Letters, J.J. Tikkanen's sketch books and Väinö Raitio’s musical manuscripts. Also the main card index of the Manuscipt Collection is available.

User interface
Doria
APIs

Doria OAI-PMH (use parameter set=com_10024_109242)
Doria OpenSearch
(use parameter scope=10024/109242, e.g. search Mannerheim)
Individual documents may be downloaded from Doria.

License
CC0

Maps and Atlases of Finland

A collection of digitized maps about Finland ranging from the 16th century to 20th century. Map types include Town maps, general maps of provinces and regions, nautical charts, town and parish maps, and Atlases.

User interface
Doria
APIs

Doria OAI-PMH (use parameter set=com_10024_78800)
Doria OpenSearch
(use parameter scope=10024/78800, e.g. search Turku)
Individual documents may be downloaded from Doria.

License
CC0

Nordenskiöld Map Collection, The

A selection of digitized maps  from the Nordenskiöld Collection. The maps depict the development of Western countries' geographical knowledge. They cover all continents, with a particular emphasis on Arctic areas. There is an almost complete series of the Geographica, the classic cartographic work by Claudius Ptolemy, as well as a considerable number of works related to the discovery of America.

User interface
Doria
APIs

Doria OAI-PMH (use parameter set=com_10024_97216)
Doria OpenSearch
(use parameter scope=10024/97216, e.g. search Belgia)
Individual documents may be downloaded from Doria.

License
CC0

OCR Ground Truth Package for Finnish Fraktur

Package contains 450 page images and ALTO XML files for each page, with the proofreading done by the Finnish native speakers.

Description

The pages of fraktur range from the year 1836 until 1910. The package can help in creating own postcorrection algorithms for OCR text recognition.

There is also an Excel file for all of the 471 903 words, which contains result given to the word by Tesseract and FineReader. If a tool hasn't found corresponding word, then the given cell is empty, so select the words in the Excel, which you need.

NB! The ground truth package does not contain the data for the 1918 due to copyright reasons.

User interface

Data downloads
http://digi.kansalliskirjasto.fi/opendata  
API
-
License
Terms of use (in Finnish).

OCR Ground Truth Package for Swedish Fraktur

Package contains page images and ALTO XML files for each page, with the proofreading done in Swedish  by the Finnish native speakers.

Description

The pages of fraktur range from the year 1771 until 1915. The package can help in creating own post correction algorithms for OCR text recognition.

Note1. the tiff files exif metadata lacks resolution information, so if the coordinates of ALTO do not match, be aware that images has been done either 200 or 300 dpi.

Note2. The ground truth package does not contain the data of the 1918 or later due to copyright reasons.

User interface

Data downloads
http://digi.kansalliskirjasto.fi/opendata  
API
-
License
Terms of use

Raita: Early Finnish Recordings

Raita is a collection of digitized early Finnish sound recordings.

User interface
Doria
APIs

Doria OAI-PMH (use parameter set=col_10024_66373)
Doria OpenSearch
(use parameter scope=10024/66373, e.g. search Verdi)
Individual documents may be downloaded from Doria.

License
CC0

Technical Ephemera Collection

Digitised collection of technical ephemera (selection of brochures, ads, leaflets, price catalogues and instruction guides)

Description

Detailed description (in Finnish)

User interface
Ephemera at digi.kansalliskirjasto.fi
Data downloads
-
API
-
License

Tesseract3 Finnish fraktur model

Tesseract 3 Finnish fraktur model

User interface

Copy the file to the Tesseract’s TESSDATA directory.

You can utilize the file in Tesseract via:

tesseract input.jpg out -l fi_frak_nlf 


See also tesseract 3 in Github:  https://github.com/tesseract-ocr/tesseract

Data downloads
http://digi.kansalliskirjasto.fi/opendata  
API
-
License
Terms of use

Translocalis clippings 1820-1885

Translocalis clippings 1820-1885

Description

Translocalis is a digital database for reader letters written in different locations and published in Finnish papers up to the year 1885. The Translocalis database contains 72 000 reader letters from Finland and abroad. In the name of the collection, trans refers to an object going over or through something and localis refers to a space or location. Combined, Translocalis expresses something more than local, which these local letters represented. The Finnish-speaking press started out on a nation-wide level and became more regional and local only during the latter half of the 19th century.

User interface
https://digi.kansalliskirjasto.fi/collections?id=742
Data downloads

Zip package, which contains

  • individual text file for each clipping text 
  • Log of downloaded clippings
  • Clippings metadata excels 1820-1883 and 1884-1885


Folder Structure:

translocalis_data/year/ISSN/txt/388373_2973653_1457-4403_1868-04-09_15_page-2

  • File name is formulated by article_id, binding_id, issn, publishing date (YYYY-MM-DD), issue, and page on which clipping has been taken.


Clippings metadata (2 files)

(part until 1883):  translocalis_clippings_export_1820_1883.xlsx 

(1884-1885):  translocalis_clippings_export_1884_1885.xlsx 


Fields in metadata  excels:

Main title  - name of the newspaper

ISSN  - (International Standard Serial Number), i.e. newpaper identifier

Date - Publishing date of thee number

Issue - Issue number of newspaper (can be empty, or contain A, Bs e.g. for extra editions)

URL - link to the clipping page

Title - Title eof the article

Keywords - keywords of clipping  (see Wiki for full explanations of translocalis_xyz fields)

Category - category of the article

Subject 

Notes - if any extra notes

Created  - when clipping was created

OCR - text contents of the clipping



APIs

Digi OAI-PMH

Digi OpenURL

License
Terms of use

Uusi Suometar (1457-4721) ALTO XML

ALTO XML files of newspaper Uusi Suometar (1457-4721) years 1869-1917.

Description

ALTO XMLs as they have been produced in the digitisation.

Year 1918 excluded.

User interface
https://digi.kansalliskirjasto.fi
Data downloads
https://digi.kansalliskirjasto.fi/opendata  
API
-
License
Terms of use

Uusi Suometar (1457-4721) REOCR ALTO XML

The REOCR'd ALTO XML files of newspaper Uusi Suometar (1457-4721) years 1869-1917.

Description

ALTO XMLs as they have been produced in the digitisation.

Year 1918 excluded. 

User interface
https://digi.kansalliskirjasto.fi
Data downloads
https://digi.kansalliskirjasto.fi/opendata  
API
-
License
Terms of use

Metadata

Arto

ARTO is an aggregation of metadata on Finnish periodical and monograph articles. Formerly a discrete database, it is now part of Melinda. 

User interface
Arto in the National Library search service
Data downloads
MARC records
APIs

Z39.50 and SRU

OAI-PMH

License
CC0

Fennica

Fennica - the Finnish National Bibliography is a database dedicated to Finnish publication activities.

Description

The database in based on the National Colllection materials provided pursuant to provisions in the Act on Collecting and Preserving Cultural Materials and it complies with international recommendations on national bibliographies.

Detailed description in English: Repository description: Fennica , and in Finnish: Tietovarantokuvaus: Fennica

User interface
Fennica in the search portal of the National Library
Data downloads
MARC records
APIs

Z39.50 and SRU

OAI-PMH

License
CC0

Fennica-LD

This is the Linked Data version of Fennica - the Finnish National Bibliography. 

Description

The original data from Fennica as well as auxiliary data sets including the subject authorities (YSA and YSO) and the corporate name authority has been combined into an RDF representation.

User interface
Fennica Linked Data browser
Data downloads
RDF Linked Data (N-Triples and HDT formats)
APIs
Linked Data
License
CC0

Finna.fi

Finna is a search service that aggregates metadata from Finnish archives, libraries and museums.

Description
The Finna.fi main index contains metadata from over 100 institutions, including the main bibliographic databases.
User interface

Finna.fi

Data downloads
-
APIs
Finna REST API 
License
CC0

Melinda

Melinda is the National Metadata Repository (union catalog).

User interface
Melinda OPAC
Data downloads
MARC records
APIs

Z39.50 and SRU

OAI-PMH

License
CC0

Viola

Viola is the Finnish national discography and the national bibliography of sheet music.

Description
Viola contains references on Finnish sound recordings since 1901 and Finnish sheet music since 1977. It contains information about musical publications as well as single works and pieces therein. Formerly a discrete database, it is now accessible as part of other cataloguing information of the National Library.
User interface
Viola in the search portal of the National Library of Finland.
Data downloads
MARC records
APIs

Z39.50 and SRU

OAI-PMH

License
CC0

Vocabularies

Allärs - Allmän tesaurus på svenska

Allärs is the Swedish translation of YSA - General Finnish Thesaurus.

Description

The contents of Allärs increase as the contents of YSA increase. The thesaurus has a link to the corresponding term in the Finnish YSA thesaurus and the corresponding concept in the ALLFO ontology. Åbo Akademi University Library together with the National Library are responsible for maintaining Allärs.

User interface

Finto/Allärs

Data downloads

RDF serializations: Turtle and RDF/XML

APIs
Linked Data, Finto REST API
License
CC0

FGF - Finnish genre and form vocabulary

FGF (in Finnish known as SLM) is a bilingual (Finnish + Swedish) vocabulary for describing genres and forms of literature (both fiction and non-fiction) and music.

User interface

Finto/SLM

Data downloads
RDF serializations: Turtle and RDF/XML
APIs
Linked Data, Finto REST API
License
CC0

Finnish Corporate Names

The Finnish corporate names data set is used by the National Library of Finland in the description of the national bibliography Fennica.

User interface

Finto/CN

Data downloads

RDF serializations: Turtle and RDF/XML

APIs
Linked Data, Finto REST API
License
CC0

ISIL Identifiers of Finnish Libraries

An ISIL identifies a library, an archive, a museum or a related organization, or one of its subordinate units.

Description

Besides these institutions, an ISIL can also be assigned to agencies co-operating or doing business with these organizations, such as suppliers, publishers and government institutions. ISIL is intended to be used as a permanent identifier.

User interface
The ISIL Standard Identifiers of Finnish Libraries
Data downloads
API can be used to retrieve all ISIL identifiers using an empty query string
APIs
ISIL REST API
License
-

KOKO Ontology

KOKO is a collection of Finnish core ontologies, which have been merged together.

Description

The ontologies include the General Finnish Ontology YSO and ontologies that extend and refine YSO such as the Ontology for Museum Domain, the Ontology of Applied Arts, and the Finnish Ontology of Photography.

User interface
Finto/KOKO
Data downloads
RDF serializations: Turtle and RDF/XML
APIs
Linked Data, Finto REST API
License
CC Attribution 3.0 International

Metadata thesaurus

Metadata thesaurus contains terms and expressions required in describing materials. The thesaurus is also suitable for selecting headings displayed in user interfaces.

Description

The thesaurus has been organised according to entity-relationship structure of RDA text and FRBR concept model.

The use of the terms in the thesaurus standardises the vocabulary of descriptive metadata and simultaneously harmonises description in order to promote productiveness.The aim of the thesaurus is also to make it easier to produce metadata by collecting vocabulary and expressions needed in description as well as to clarify concepts related to them.

User interface

Finto/Metadata thesaurus

Data downloads
RDF serializations: Turtle and RDF/XML
APIs
Linked Data, Finto REST API
License
CC0

PLC - Finnish Public Libraries Classification System

The classification system for Finnish public libraries PLC (known by its Finnish acronym YKL) is an adaptation of the Dewey Decimal Classification.

Description
PLC is a decimal classification, used in libraries to place books on shelves based on their principal subject matter.
User interface
Finto/YKL
Data downloads
RDF serializations: Turtle and RDF/XML
APIs
Linked Data, Finto REST API
License
CC0

SEKO - Finnish Medium of Performance Thesaurus

SEKO is a thesaurus in Finnish which covers instruments, voices etc. used in the performance of musical works.

Description

SEKO is linked to the  Library of Congress Medium of Performance Thesaurus for Music.

User interface

Finto/SEKO

Data downloads
RDF serializations: Turtle and RDF/XML
APIs
Linked Data, Finto REST API
License
CC0

UDC Summary

The Multilingual Universal Decimal Classification Summary.

Description

UDC Summary (udcS) for short - represents a selection of around 2,600 classes extracted from the UDC Master Reference File (UDC MRF) 2011 which contains over 70,000 classes. The selection comprises main numbers, common auxiliary numbers and special auxiliary numbers and it represents even coverage of all areas of knowledge as contained by the scheme.

The version published in Finto is a trilingual classification extracted from the main UDC Summary SKOS file and contains labels in Finnish, Swedish and English.

User interface
Finto/UDCS
Data downloads
RDF serializations: Turtle and RDF/XML
APIs
Linked Data, Finto REST API
License
CC Attribution-ShareAlike 3.0

YSA - General Finnish Thesaurus

YSA is a general thesaurus in Finnish which covers all fields of research and knowledge, which contains the most common terms and geographical names used in content description.

Description

YSA is a tool for providing index terms for printed and electronic materials as well as subject-based information retrieval. The thesaurus works as a shared language between information depositaries and searchers, which improves the ease of finding information.

User interface

Finto/YSA

Data downloads
RDF serializations: Turtle and RDF/XML
APIs
Linked Data, Finto REST API
License
CC0

YSE - concept suggestions for YSO

YSE is a collection of concepts that have been suggested for inclusion in YSO, but they have not yet been accepted.

User interface

Finto/YSE

User interface for making concept suggestions

Data downloads
RDF serializations: Turtle and RDF/XML
APIs
Linked Data, Finto REST API
License
CC0

YSO - General Finnish Ontology

General Finnish Ontology YSO is a trilingual ontology consisting mainly of general concepts.

Description

YSO has been founded on the basis of concepts in Finnish cultural sphere. As an indexing tool it is best applicable when indexed material is interdiscliplinary and its themes vary to a great extent.

User interface
Finto/YSO
Data downloads
RDF serializations: Turtle and RDF/XML
APIs
Linked Data, Finto REST API
License
CC Attribution 4.0 International

YSO places

YSO places is a multilingual gazetteer.

Description

YSO Places contains administrative and geographical areas, both current and historical, that are selected based on the subject indexing needs of memory organizations. About two thirds of the places are within the current borders of Finland.

User interface
YSO places in Finto
Data downloads
RDF serializations: Turtle and RDF/XML
APIs
Linked Data, Finto REST API
License
CC0

  • No labels