Introduction
Overview
Teaching: 2 min
Exercises: 0 minQuestions
What will we cover this morning?
Objectives
Deep Dive into Data Management
So far we have tackled the general concepts around data management: F.A.I.R concepts and the data life cycle. And where BCO-DMO fits in that picture.
F.A.I.R geared toward data curators rather than individual researchers, and to facilitate machine accessibility of data in addition to human readability. But you should endeavor to use resources that help move your data closer to being FAIR. BCO-DMO engages in all of the FAIR principles.
There are steps individual researchers can take to increase the “fairness” of their data. This will be the focus of today. We will dive deeper into hands-on management of data tables and the general practices of (re)using a data table.
Part 1: Good approaches to Tabular Data Formatting
Spreadsheets are good for data entry. We have a lot of data in spreadsheets. Much of your time as a researcher will be spent in this ‘data wrangling’ stage. It’s not the most fun, but it’s necessary. We’ll teach you how to think about data organization and some practices for more effective data wrangling. After this lesson, you will be able to:
- Implement best practices in data table formatting
- Identify and address common formatting mistakes
- Understand approaches for handling dates in spreadsheets
- Effectively export data from spreadsheet programs
Table of content:
- Tabular Data and Spreadsheets
- Common formatting mistakes & exercise
- Discussion
- Dates are special
- Exporting data
Part 2: Introduction to Data Reuse, Access and Provenance
After this lesson, you will be able to:
- Assess a dataset to see if it is fit for data reuse.
- Learn about different methods for accessing data at BCO-DMO.
- Learn the benefits and drawbacks to analyzing data in spreadsheets.
- Create good records and metadata to help your future self, and enable FAIR data.
- Learn about about other methods and tools for using data outside of spreadsheets.
Table of content:
- Assessing metadata
- Download data
- Data manipulation in spreadsheets
- What is next?
Key Points