Responsible data management emphasises careful planning of the life cycle of data and consideration of the principles of research ethics in the collection, processing and opening of data. Responsible conduct of research applies to all research, and its basic principles are reliability, honesty, respect, and responsibility. In accordance with responsible conduct of research, the acquisition, processing, and storage of research data must meet the criteria for scientific research, and be ethically sustainable. In addition, research conducted in different fields may be regulated by field-specific ethical guidelines, which must always be followed when collecting and processing data.
Examples of field- and subject-specific ethical guidelines and recommendations:
Also, always check what kind of discussion is taking place about research ethics in the context of your own scientific field, and what kind of key sources are cited. In many fields, there is printed literature and online texts on field-specific ethical reflection (e.g. historical research, internet research, children's and youth research, fields of technology, media and communication studies).
Developing a research career requires competence in responsible data management, and internalising ethical principles is part of every researcher's basic skills. Each researcher and members of the research group are responsible for ensuring that both general and field-specific principles of research ethics are followed in conducting research. Responsibility is based on planning and, in the case of data, on careful planning of its life cycle. A data management plan helps in planning and managing the life cycle of data.
Figure: Example of planning the ethical lifecycle of data.
In responsible data management, you can identify research ethical perspectives and process data and research subjects accordingly. Both data management and research ethics should be understood as a cross-cutting theme for the entire research process. It is often difficult to give one correct answer to questions on research ethics, as ethical challenges vary depending on the research question and data.
Research data refers to the data produced, modified, and used in scientific research on which the results of the research are based. Research data also consists of metadata describing the context, datasets, or observational units of the data. The materials, methods, and results used and produced in research, development and innovation (RDI) projects are also research data in this context.
Different data types may have different requirements for data management, which the researcher must be able to understand.
Strong competence in data management skills is utilised to produce high-quality and impactful research in which the processes used, and the outputs obtained are repeatable and transparent. Solid knowledge of data management is important at all stages of a researcher's career.
According to the general principles of research competence defined by the University, the University "places an exceedingly high value on research competence and provides its staff with support and resources for acquiring, assessing, developing and maintaining their research competence throughout their careers." To maintain these skills, each researcher at the University "must ensure that they have a basic level of competence at least in the areas of research ethics, data protection, data management, research methodologies and open science."
To achieve a basic level of competence in data management, a researcher should:
In the later stages of a researcher's career, in addition to the basic level of data management skills, the researcher must master several data management responsibilities related to project management. The good data management skills of the project manager serve as an example for the other researchers participating in the research and guarantee top-level research. Well-managed data management work enables:
The project manager’s data management responsibilities include:
To achieve the above-mentioned learning outcomes related to project management, the principal investigator of the research must also take care of their own competence in data management. The principal investigator shall:
When conducting research, always agree on the rights of use, ownership, division of responsibilities, processing of personal data, and management of sensitive data even before starting data collection. This way you can ensure and clarify your own and other researchers' rights to use the data. If issues related to data rights have not been considered early enough, it may not be possible to share and make the data open access. Agreements regarding data, researchers' rights, responsibilities, and obligations are also part of responsible conduct of research. In addition, research funders may have conditions related to contracts, rights, and ownership in their funding terms.
Agreements regarding data are emphasised in research projects. It is the responsibility of the principal investigator to ensure that the necessary agreements are signed. It is safest to make the agreement in writing and update it if necessary. In this way, researchers' rights of use can be defined and confirmed, and the party that makes the decisions related to the data can be determined.
Always also agree on the authorship of the data in the project. Defining authorship helps downstream users of research data to refer to the author correctly. The production and distribution of datasets is also counted as a scientific merit in the template for researcher's curriculum vitae. The definition of authorship is also central from the perspective of responsible conduct of research. Make sure to agree on the authorship of the research data at the very beginning of the research project. The Finnish National Board on Research Integrity's recommendation on agreeing on authorship for research publications (PDF) can be used as a model.
Questions to consider when coming to an agreement regarding data:
You can get help in drafting research-related agreements from the University's legal services. Agreement templates for more general agreements can be found on the University's intranet. Research projects may include, among other things, the following agreements:
The rights related to data also include deciding on the licence of the published data. Licensing ensures the usability of the data according to clear terms. In accordance with the Open Science Policy of the Tampere University community, machine-readable licences that allow reuse should be favoured in the publication of data. Funders may also have conditions regarding licences for data and metadata. CC licenses require people using your work to credit you as the original author in a manner you want, but your work can be shared and edited according to the conditions you set. The recommended Creative Commons licenses for research data are CC BY and CC0. A CC0 license is recommended for metadata. Read more about choosing a license on the Creative Commons page.
Research use of social media data has increased. The attractive aspects of social media data are the amount and diversity of data, as well as the opportunity to study social media with different methods and perspectives. There is no one right way to study social media data. Social media sets boundaries for the collection and use of data, which must always be considered before data is collected.
When collecting and using social media data, the following points should be considered:
Further reading:
Ahteensuu, Marko (2019), Do you use social media data in your research? Responsible research web pages.
Laaksonen, Salla-Maaria (2018). Expert ethical online research. Responsible research web pages.
Rossi, Arianna (2022). The Hitchhiker's Guide to the Social Media Data Research Galaxy - A Primer. In: Bieker, F., Meyer, J., Pape, S., Schiering, I., Weich, A. (eds) Privacy and Identity Management. Privacy and Identity 2022. IFIP Advances in Information and Communication Technology, vol 671. Springer, Cham. https://doi.org/10.1007/978-3-031-31971-6_6.
Examples of studies, where social media data has been used:
Hiippala, T., Hausmann, A., Tenkanen, H., Toivonen, T., (2019) Exploring the linguistic landscape of geotagged social media content in urban environments, Digital Scholarship in the Humanities, Volume 34, Issue 2, June 2019, 290–309, DOI: https://doi.org/10.1093/llc/fqy049.
Chen, Y., Sherren, K., Smit, M., & Lee, K. Y. (2023). Using social media images as data in social science research. New Media & Society, 25(4), 849-871. DOI: https://doi.org/10.1177/14614448211038761.
Jauho, M., Pääkkönen, J., Isotalo, V., Pöyry, E. & Laaksonen, S-M., (2023) How do trendy diets emerge? An exploratory social media study on the low-carbohydrate diet in Finland, Food, Culture & Society, 26:2, 344-369, DOI: 10.1080/15528014.2021.1971436.
Ohme, J., Araujo, T., Boeschoten, L., Freelon, D., Ram, N., B. Reeves, B., N. Robinson, T., (2023) Digital Trace Data Collection for Social Media Effects Research: APIs, Data Donation, and (Screen) Tracking, Communication Methods and Measures, DOI: 10.1080/19312458.2023.2181319.