Skip to main content

Data management

Data Management is a set of practices and techniques used by researchers to ensure that their data is organised, structured and easily reusable for future research

Resources

  • Git Collaboration

    EN
    This short course will help learners understand how to work with Git in a collaborative setting such as teamwork or group projects, and how to make use of platforms like GitHub or GitLab to complete that work.
  • Data Ethics in Cultural Heritage

    EN
    This resource aims to introduce the main aspects of data ethics in the cultural heritage domain. It also examines how data management can be supported to become more ethical, while also addressing topical discourse about data ethics in the sector. The resource also aims to support in critically reflecting on some case studies with evident digital data ethics considerations.
  • Introduction to Collections as Data

    EN
    The goal of this course is to introduce the Collections as Data principles in the cultural heritage sector to make available a digital collection suitable for computational use. Students will have a fundamental understanding of the complexities of Collections as Data as well as an appreciation of the diversity of the content provided by cultural heritage institutions. This course will be useful for small and medium-sized institutions willing to make available their digital collections suitable for computational use.
  • Scalable Reading of Structured Data

    EN
    In this lesson, you will be introduced to ‘scalable reading’ and how to apply this workflow to your analysis of structured data.
    Authors
    • Max Odsbjerg Pedersen
    • Josephine Møller Jensen
    • Victor Harbo Johnston
    Read more
  • Performing Arts: Transitioning to the Digital Age

    EN
    The conference aimed to examine the possibilities of connecting information sciences and computer science with performing arts, focusing on three thematic blocks: archiving, artistic practices and scholarly research. The international scientific and professional conference is part of the project of the same name by the DARIAH-EU Working Group Theatralia, which is dedicated to the research of digital technology in the performing arts and the digitization of theatralia, financed from DARIAH-EU funds.
    Authors
    • Anamarija Žugić Borić
    • Antonia Hladilo Duspara
    • Koraljka Kuzman Šlogar
    Read more
  • Introduction to Knowledge Organization Systems

    EN
    Knowledge Organization Systems provide the foundation for searching and retrieving information objects across digital collections and cultural heritage information systems. Starting from case studies this course provides an introduction to knowledge organization systems and their practical application in research contexts.
  • Using Spatial Data in Tableau

    EN
    Tableau is a powerful digital tool for analysing data that can help with mapping and interrogating data. In this short guide we will focus on an aspect of data analysis using mapping that has particular application for Holocaust and refugee studies.
  • Entity Matching

    EN
    EHRI (European Holocaust Research Infrastructure) supports the use of digital tools that can assist in the research of Holocaust and refugee related topics. In a continued effort to make these tools as accessible as possible so that researchers who have no experience with digital tools will consider trying new ways of using their data, this GitHub-based lesson showcases the use of entity match tools when dealing with geographic data.
  • What Can I Do With This Messy Spreadsheet? Converting from Excel Sheets to Fully Compliant EAD-XML files

    EN
    Many Galleries, Libraries, Archives, and Museums (GLAMs) face difficulties sharing their collections metadata in standardised and sustainable ways, meaning that staff rely on more familiar general purpose office programs such as spreadsheets. However, while these tools offer a simple approach to data registration and digitisation they don’t allow for more advanced uses. This blogpost from EHRI explains a procedure for producing EAD (Encoded Archival Description) files from an Excel spreadsheet using OpenRefine.
  • Windows Subsystem for Linux (WSL)

    EN
    Many tools and examples that are of interest to those wishing to explore, experiment, and develop projects for digital humanities or data analysis and other tasks are based on a Linux operating system. Mac iOS laptops support Linux fairly easily. However, until recently, Windows OS users have had difficulty in accessing programs and techniques that require a Linux operating system. This short tutorial will demonstrate a simple way for most Windows 10 users to run Linux programs and systems through Windows Subsystem for Linux (WSL).
  • Extracting CSV Data from the EHRI Search API

    EN
    The EHRI (European Holocaust Research Infrastructure) Search API provides a way to retrieve information about items in the EHRI portal in JSON (JavaScript Object Notation) format by making HTTP requests to particular URLs. This short tutorial shows learners how use a command line tool (CuRL) to fetch structured data and transform it into CSV (comma separated values) format for import into a spreadsheet like Excel or Google Docs.
  • Importing tables from websites into spreadsheets

    EN
    Sometimes it can be useful to take information from a website, such as document lists from archives, for future reference. This short resource will show the user how to download an extension to copy tables from websites and then import the table into a spreadsheet program.