Annual Report 2023

Towards pan-european regulation

The European Commission’s proposal for a Regulation on the European Health Data Space (EHDS) advanced to trilogue negotiations at the end of 2023. One of the highlights of the past year for Findata and the Finnish research infrastructure as a whole was the Commission’s €2.5 million funding to Findata for the FinHITS project preparing the implementation of the EHDS.

The four-year FinHITS project will provide us with significant additional resources to develop secondary use services and enable Finland to join the European health data space.

International cooperation on secondary use of health data also continued in the TEHDAS project coordinated by Sitra and in the EHDS2 pilot, where we led the development of a pan-European application form.

As in previous years, international interest in Findata was reflected in numerous visits and requests for presentations. Finland is a pioneering country in health data, even by European standards.

The processing of applications accelerated over the last year, and we were able to clear the queues that had built up at the beginning of the year. We continued to improve our efficiency by automating manual steps and streamlining processes.

Johanna Seppänen, PhD, Director

One thousand applications milestone reached in May 2023

In its 3.5 years of operation, Findata has received around 1 200 applications. The number of new applications received last year was slightly up from 2022.

In particular, the demand for statistical data and non-structured textual data increased. In terms of customer segments, the share of private sector applicants increased and now accounts for about one third of all applicants.

The processing of applications accelerated over the last year, and we were able to clear the queues that had built up at the beginning of the year. We continued to improve our efficiency by automating manual steps and streamlining processes.

The number of datasets provided by Findata almost doubled

The number of data sets received and processed by Findata increased compared to previous years. In 2023, Findata delivered a complete set of data for a total of 167 projects.

During the year, we strengthened the resources of our data team through recruitment, the development of AI-based tools and the acquisition of additional expertise from our partner CSC, particularly in processing large volumes of text.

A greater variety of data could be extracted from Kanta services, which will ease the extraction burden on wellbeing services counties in the future.

Closer cooperation with data controllers

We started to visit the new wellbeing services counties that have started their activities. These visits will continue this year. We organised briefings on the secondary use of social and health data and training on data description in the wellbeing services counties. We continued our well-established bilateral meetings with other key data controllers.

Findata’s controller and customer collaboration groups met regularly to discuss common topics such as issues in the wellbeing services counties, application processing, secure processing environments and the EHDS.

We started a reciprocal pilot with data controllers to share the costs of additional work caused by extraction errors. The aim of this so called solidarity model is to reduce the workload caused by extraction errors and to minimise the costs for applicants.

Many thanks to controllers, customers and partners for their good cooperation! We have a strong basis to move forward towards common European practices!

Johanna Seppänen, PhD, Director

2023 in figures

Compared to last year, the number of applications increased by 10 percent and the number of decisions by 24 percent.

296 applications
(270 year 2022)

58% amendment applications
29% data permit applications
13% data requests
351 decisions
(284 year 2022)

84% positive
15% lapsed
1 transfer of administrative matters
Graph: applications received and decisions taken 2021-2023.
2021: 312 applications, 262 decisions
2022: 270 applications, 284 decisions
2023: 296 applications, 351 decisions.
Graph: applications received and decision issued 2021–2023

Processing times for data permit applications continued to vary widely, but the median time for processing dropped from 78 days at the beginning of the year to 72 days at the end of the year. Applications spent about a quarter of their processing time on Findata’s desk last year. The rest of the time the applications were waiting for information from the client or the controllers.

Most applications for amendment permits were processed very quickly, with a median processing time of 3 days.

Decisions by type of permit

Graph: number of decisions by type of permit.
2022: 108 data permits, 132 amendment permits and 2 data request decisions.
2023: 117 data permits, 153 amendment permits and 26 data request decisions.
Graph: number of decisions by type of permit

One data permit covered, on average, data from four different data controllers. One data permit covered in maximum data held by 15 controllers, while the minimum number of controllers included in one data permit was one. In total, we issued data permits for data held by 51 different controllers.

More than a half of the amendment decisions on data permits concerned changing the processors of personal data and just under a fifth concerned the extension of the validity of the permit. Around 15% of amendment decisions concerned the addition of data or an extension of the extraction period while around a tenth of the amendments concerned a change of processing environment. The rest concerned changes to controllers and the transfer of processing outside the EU/EEA countries.

The share of data requests reflects the increase in demand for statistical data. We made a total of 26 data request decisions, compared to only a few in previous years.

The centralisation of THL’s statistical data services in Findata was a major explanatory factor, but data provders serving wellbeing services counties, for example, also made use of the possibility to obtain pre-customised statistics.

The share of decisions subject to appeal continued to drop from 18% to 15%, compared with 33% in our first year of operation in 2020.

Purpose of use of the permits and data requests granted

The vast majority of the permits, 91%, were granted for scientific research.

An increase in the number of data request decisions raised the share of decisions granted for statistical purposes up to 7%.

As in previous years, only a few positive decisions were taken for other purposes.

No data permit or data request decisions were taken for education, guidance and supervision of a social and healthcare authority or knowledge management.

Graph: distribution of uses of permits granted in 2023. 270 permits
(91%) were granted for scientific research, 21 (7%) for statistics, 2 (1%) for development and innovation and 2 (1%) for planning and reporting duties of an authority.
Graph: distribution of the purposes of use of the permits and data requests granted in 2023

See all the permits granted by us and the permit holders here: Permits issued

Applicants and their backgrounds

Last year, the number of amendment applications increased from 47% to 58%, the number of data permit applications fell from 47% to 29% and the number of data request applications increased to 13%.

There were also changes in the applicants’ backgrounds. Applicants from the public sector held the top spot but were increasingly joined by applicants from the private sector.

Graph: background of applicants 2023. 170 applications from the public sector (58%), 105 applications from the private sector (35%) and 21 applications from others (7%)
Graph: Applicants’ backgrounds in 2023

The public sector covered 58% of the applications submitted last year and the private sector 35%.

The ‘others’ category mainly comprises customers from the third sector and those who have requested data as private individuals.

We categorize the background of the applicants according to the main applicant. Some of the permits have been granted to projects or consortia that involve not only the main applicant but also other sectors.

Controllers and organisations associated with the highest numbers of applications

In 2023, the demand was greatest for national registers. About 75 percent of the applications sought data from the registers of the three most popular controllers – Finnish Institute for Health and Welfare (THL), Kela and/or Statistics Finland.

The next most applied were the data of the Digital and Population Information Agency and the Finnish Centre for Pensions. The number of applications for data from Wellbeing Services Counties and Hospital District of Helsinki and Uusimaa was lower than in the previous year.

PopularityControllerNumber of applications 2021Number of applications 2022Number of applications 2023
1.Finnish Institute for Health and Welfare (THL)113110116
2.Kela667661
3.Statistics Finland516445
4.Digital and Population Information Agency (DVV)294225
5.Finnish Centre for Pensions (ETK)151517
6.HUS274316
7.The Wellbeing Services County of Southwest Finland273314
8.The Wellbeing Services County of Pirkanmaa142812
9.Cancer Society of Finland71212
10.The Wellbeing Services County of North Ostrobothnia8239

Data sampling requests and deliveries

678 submitted data sampling requests
(639 in 2022)

13 requests/week
730 data sets received
(582 in 2022)

14 sets/week
766 data sets delivered
(413 in 2022)

15 sets/week
167 data packages delivered
(86 in 2022)

Around 5 sets/package

The number of datasets almost doubled in 2023, when we delivered a dataset for a total of 167 projects. The increase in the quantity of data, combined with the demand for large textual data sets, led to a backlog in data processing in the beginning of the year, which was resolved by the autumn.

Graph: volume of data processing at different stages in 2021-2023. Figures for 2022 and 2023 are shown in the text.
Graph: volume of data processing at different stages 2021-2023

The volume of result data anonymity verifications continued to increase. In 2023, we verified the anonymity of clients’ results a total of 747 times, compared to 416 times in 2022.

In 81% of the verifications, the results were found to be acceptable as they were. In 19% of the cases, we found something to correct in the contents of the results, a slight increase from 13% the previous year. Most typically, the problems were related to the presentation of small frequency data.

As the amount of data sets increased, there were also more extraction errors than in previous years. We are working to reduce and streamline the handling of these errors through a joint pilot between Findata and the controllers and by adding a extraction checkpoint before data is extracted at the request of the applicant or the controller.

We continued to support the data description work by offering training to controllers and by developing tools, the Data editor and the Data catalogue.

Kapseli®

The number of Kapseli processing environments increased by 23 percent and the number of users 29 percent over the past year.

The number of Findata’s secure Kapseli processing environments continued growing.

There were 139 Kapseli’s in use at the end of 2023, with 1 073 registered users. The average number of users per one Kapseli has increased from five to eight in three years.

In addition to the most commonly used software, new services added to Kapseli last year include a Linux operating system and a storage service for storing data and program code.

Graph: number of Kapselis by different machine packages in 2021-2023.
Graph: number of Kapselis by different machine packages in 2021-2023
Graph: evolution of the number of capsule users.
At the end of 2021, 271 users, at the end of 2022, 830 and at the end of 2023, 1073.
Graph: evolution of the number of users registered to Kapseli between 2021 and 2023

Distribution of costs

A total of approximately EUR 2 million was paid for the secondary use of social and health data through Findata in 2023.

EUR 696 000 Findata
Permit decisions EUR 285 000
Data processing EUR 411 000

+ EUR 316 000 compared to 2022
EUR 1 293 000 Controllers
Data extraction costs charged by controllers to customers.

– EUR 69 000 compared to 2022

The figure includes Findata’s decision fees for new data permits, amendment permits and data requests, Findata’s data processing fees and extraction costs charged to customers by data controllers.

On average, 35% of the total invoice charged to customers consisted of Findata’s fees, such as decision fee and the cost of processing the data. The processing costs arise from combining data sets collected from controllers, pseudonymisation or anonymisation, and delivering the data to a secure processing environment.

Correspondingly, on average 65% of the total invoice consisted of data extraction costs charged by controllers to customers.

For the use of Kapseli, Findata’s customers paid a total of €371 000, which is €142 000 more than in 2022.

Findata’s decision and data processing fees and data controller’s extraction costs in 2023

Graph: Findata's permit and processing fees and data charges billed by controllers. Figures reported in the text.
Graph: Findata’s decision and processing fees and data extraction costs charged by controllers

Top 10 most invoiced controllers in 2023

SijaRekisterinpitäjäAmount invoiced (EUR)Invoiced extractions (pcs)
1.Kela332 100116
2.Finnish Institute for Health and Welfare (THL)221 100176
3.Digital and Population Information Agency (DVV)159 70035
4.HUS157 60023
5.The Wellbeing Services County of Southwest Finland125 70021
6.The Wellbeing Services County of Pirkanmaa43 9007
7.The Wellbeing Services County of North Ostrobothnia40 5008
8.Docrates Oy31 1002
9.Statistics Finland26 40065
10.Finnish Centre for Pensions24 70018
Total of 27 other controllers130 20063
Total1 293 000534

Communication and stakeholders

In 2023, we focused on clarity in our communication.

We revamped our customer application clinics and improved the way we communicate about our current queues on our front page. We also redesigned all our application forms and decision documents. All of these changes are designed to support our core mission and make our services easier to use.

We also continued to develop our website. Last year, the Regional State Administrative Agency for Southern Finland carried out a simplified accessibility audit of our website, which we used as a framework to make technical and content improvements to our website’s accessibility.

We simplified our instructions and provided a more comprehensive description of our privacy policy. We also updated the three different language versions of the site to make them more consistent.

Based on numerous requests for visits, speeches and events, Finland continues to be a target of international interest in the secondary use of social data. Cooperation with domestic stakeholders continued to be intensive in working groups set up by the Ministry of Social Affairs and Health and in more informal networks.

Events and presentations

25 events and training
organised by Findata.
~50 presentations
at Finnish and international events organised by others.

Social media channels

1 122 X
+ 26 followers compared to 2022
3 280 LinkedIn
+ 989 followers compared to 2022
1 770 newsletter subscribers
+ 473 subscribers compared to 2022

Highlights of 2023

Findata receives 2.5 million euros in EU funding

26.10.2023
EU is currently working on harmonised legislation on health data. Funding for the promotion of secondary data use was fully granted to Findata. Read more Findata receives 2.5 million euros in EU funding

Findata’s application forms have been renewed

29.05.2023
We have renewed all our application forms during March-May 2023. The goal of the renewal is to make applying easier and minimize additional information requests sent to applicants. Read more Findata’s application forms have been renewed

Reimbursement for Kapseli network drive disruptions

24.05.2023
We at Findata are announcing a compensation plan for the recent Kapseli network drive disturbances that occurred between February and March. Read more Reimbursement for Kapseli network drive disruptions

More than a thousand applications – social and health data have been applied from Findata already 1,008 times

11.05.2023
The number has been accumulated since the start of Findata’s operations in April 2020. In the same period, a total of 868 authority decisions have been made on applications. Read more More than a thousand applications – social and health data have been applied from Findata already 1,008 times

Valvira has transferred its authority to issue data permits to Findata

23.03.2023
From now on, Findata will also process data permit and amendment applications on Valvira’s behalf, that only concern data in Valvira’s registry. Read more Valvira has transferred its authority to issue data permits to Findata

Services

Guidance

We offer general guidance on our services. If you have a question, do not hesitate to contact us! Read more Guidance

Permits and amendment permits

We grant permits for the secondary use of social and health data. Read more Permits and amendment permits

Data requests

We are responsible for all data requests, regardless of whether the request is for data from numerous controllers or a single one. Read more Data requests

Data

We compile and combine data and look after their pseudonymisation or anonymisation. We also support controllers in creating data descriptions. Read more Data

Kapseli®

For data processing, we provide a secure environment named Kapseli, in which key programs required for analysing the data are available. Read more Kapseli®

Do you need social and health data for secondary purposes? See below where to apply for the permits from.

Select the controllers from which the data will be retrieved

Apply permit from the controller in question. The exception is those controllers who have delegated permit jurisdiction to Findata.

Please note that Findata is responsible for data permit and amendment applications whenever the data of data controllers covered by the Act on secondary use is combined. When evaluating the competent authority, all data related to the application under the Act must be taken into account.

Apply permit (s) from the controllers in question.

Findata is responsible for data permits of the Finnish Center for Pensions (ETK) and the Finnish Digital Agency (DVV) and / or Statistics Finland if the data are combined with

  • data of other public organizations under the Act on Secondary Use of Health and Social Data (For Statistics Finland, at least two other organizations are needed, for DVV and ETK, one is sufficient)
  • data stored on Kanta services or
  • to the register data of a private social or health care service provider.

Apply permit from Findata.

Findata is responsible for processing and making decisions concerning data permit and amendment applications, when the application applies to:

  • data from numerous public social and health sector controllers
  • register data from one or numerous private social welfare and health care service organisers, or
  • customer data saved in the Kanta Services.

Apply permit from Findata.

The Regional Administrative Agencies (AVI) have delegated the jurisdiction to Findata.

Apply permit from Findata.

National Supervisory Authority for Welfare and Health Valvira have delegated the jurisdiction to Findata.

Apply for a data permit

Finnish Institute for Health and Welfare (THL) has delegated the jurisdiction to Findata. As far as THL is concerned, the delegation of jurisdiction does not apply to its

  • internal permit management
  • the transfer of samples and data transferred to THL Biobank.

Permit is applied from Statistics Finland and the respective data controller. Exceptions are the registrars who have delegated the jurisdiction to Findata.

We are responsible for data permits for data subject to the Secondary Act of Statistics Finland when they are combined

  • to the information of at least two public organizations covered by secondary laws
  • to data stored in Kanta services or
  • to the register data of a private social or healthcare service organizer.

Apply permit from Findata.

Findata is responsible for processing and making decisions concerning data permit and amendment applications, when the application applies to:

  • data from numerous public social and health sector controllers
  • register data from one or numerous private social welfare and health care service organisers, or
  • customer data saved in the Kanta Services.

Please select at least one data controller or group.