Findata’s ready-made datasets are pre-compiled and pre-processed datasets that are available more quickly, without the need for cost estimates or extraction fees from controllers. Findata is the controller of the ready-made datasets.
Our goal is to offer ready-made datasets on specific themes. At the moment, we provide two options: a dataset collection based on registry data from the FinRegistry research project, and a COVID-19-themed ready-made dataset.
How to apply for a ready-made dataset
You can apply for a ready-made dataset by submitting a data permit application through Findata’s e-service at asiointi.findata.fi.
Each dataset is tailored and pseudonymised separately for every permit holder. Ready-made datasets consist of individual-level data and must be analysed in a secure processing environment that meets the required standards. The primary environment for this is Findata’s Kapseli.
Cost of a ready-made dataset
A data permit for one ready-made dataset costs €300. In addition to the permit fee, Findata charges extraction costs based on the amount of work required.
If you wish to combine a ready-made dataset with other data, the pricing and processing time of a standard data permit will apply.
Pricing is based on the regulation of the Ministry of Social Affairs and Health. Current prices are valid until 31 December 2025.
For full details, see our Pricing page.
Available datasets
FinRegistry-READY-MADE-DATASET
More detailed description available at: Aineistokatalogi.fi
Findata’s FinRegistry ready-made dataset is based on registry data collected in the FinRegistry research project and the research data generated from them.
The dataset includes information from the following sources, insofar as the data is covered by the Act on the Secondary Use of Health and Social Data:
- Digital and Population Data Services Agency (DVV)
- Cancer Registry
- Finnish Centre for Pensions (ETK)
- Kanta Services
- Kela
- Finnish Institute for Health and Welfare (THL)
- Statistics Finland
In total, the dataset comprises over 20 individual data sources and includes data spanning several decades.
The ready-made dataset contains three types of data:
- datasets created in the project, with a completely new file structure,
- datasets modified in the project, with file structures similar to the original datasets and
- datasets covering the original data collected for the project.
Type 3 datasets are only included in the tailored dataset when there is no corresponding modified (type 2) version available.
Findata’s FinRegistry ready-made dataset will be compiled in stages, beginning with type 1 and type 2 datasets and later expanding to include type 3 datasets. The source data for type 3 datasets have already been described in the Data Resources Catalog by the original data controller. However, the data collected for the FinRegistry project typically include fewer variables than the original datasets.
Datasets per controller
Type 1:
- Minimal phenotype, Detailed longitudinal
Type 2:
- Digital and Population Data Services Agency: Pedigree, Relative pairs, Relatives, Marriages, Living history
- Finnish Centre for Pensions: Unpaid periods and benefit periods under VEKL, Pension-insured earnings, Earnings-related pensions
- Kanta Services: Patient Data Repository: Laboratory results
- The Finnish Institute for Health and Welfare: Children born, Vaccinations, Infectious diseases, Malformations, Social assistance, Social welfare
Type 3:
- Finnish Cancer Registry: Cancer
- Statistics Finland: Causes of death
- The Social Insurance Institution of Finland: Dispensed medicines reimbursable under the National Health Insurance scheme, Entitlements to reimbursement of pharmaceutical expenses
- Kanta Services: Kanta Prescription Centre: Prescriptions, Dispensed medicines
Contrary to earlier information, the following source datasets collected by the FinRegistry reserach project are not included in Findata’s FinRegistry ready-made dataset due to their size and structure: Primary health care visits, Health care, Intensive care.
Code lists in English are available in the dataset descriptions in the National Data Catalogue (aineistokatalogi.fi), produced by the FinRegistry research project. Links to these are included in Findata’s dataset descriptions.
COVID-19-ready-made dataset
More detailed description: Aineistokatalogi.fi
The COVID-19 dataset contains data from four controllers: The Finnish Institute of Health and Welfare (THL), Kela/Kanta, Fimea and Statistics Finland. The target group is formed based on THL’s Infectious Disease Register. The data includes people who fell ill with COVID-19 in the HUS area in 2020–2021.
Data contents specific to the controller
- Fimea: information on side effects of corona vaccinations
- THL:
- primary healthcare and specialist healthcare information (Hilmo and Avohilmo registers) on COVID-19 related reception visits and ward treatment periods
- Various background information and more detailed information about COVID-19 from the Infectious Disease Register
- Kela/Kanta: comprehensive COVID-19 vaccination information
- Statistics Finland: cause of death data
Basic information
Findata’s ready-made dataset: COVID-19 | N | % |
---|---|---|
Cohort size | 138 396 | |
Male | 69 843 | 50,47 |
Female | 68 553 | 49,53 |
A diagnosis of COVID-19 in 2020 a | 20 755 | 15,00 |
A diagnosis of COVID-19 in 2021 a | 118 217 | 85,42 |
Those who received a positive diagnosis by age group in 2020 | ||
0–15 | 2 379 | 11,46 |
16+ | 18 376 | 88,54 |
Those who received a positive diagnosis by age group in 2021 | ||
0–15 | 27 040 | 22,87 |
16+ | 91 177 | 77,13 |
Those who died during the follow-up period b | 1 183 | 0,85 |
a Some of the persons included in the material were diagnosed with COVID-19 in both 2020 and 2021.
b All causes of death
More information
