Final Project
Module 8 Project Entry
PH 20010 –INTRODUCTION TO PUBLIC HEALTH INFORMATICS
Date: 21th March, 2021
Data Collection and Access
Provide an overview of primary, secondary, and emerging sources that may benefit your theme and the PH agencies and players involved.
Most of the data of Covid-19 are secondary data which is created by the agencies and the Government to access it. Some are open Source’s data and some required licenses to access it. Following are some of the sources of the data used.
Some of the Sources used are:
https://www.ecdc.europa.eu/en/covid-19/data
https://datascience.nih.gov/covid-19-open-access-resources
https://www.cdc.gov/library/researchguides/2019novelcoronavirus/researcharticles.html
This data sources require a user to register on a given platform before accessing them
https://www.kaggle.com/allen-institute-for-ai/CORD-19-research-challenge
https://data.world/covid-19-data-resource-hub/covid-19-activity-location-population-table
Challenges and issues that may be experienced in the matter according to the 3As (e.g., access barriers
One of the main challenges that may be experienced is to access the data as most of the data sources required licensed and some are restricted. Also, biggest problem is most of the data are unstructured and required effort and strategy to clean up the data.
Challenges and issues that may be experienced in the matter according to the 3Ps (e.g., difficulty of binning data)
Fortunately, most of the Covid data are organized but the Problem will be faced in Technical phase of what exactly we required from the data. Also, challenges can be faced in Binning the data as it relies on grouping continuous numeric values into summarizing categories.