Final Project

profileJaspereric
Module8ProjectEntry.docx

Module 8 Project Entry

PH 20010 –INTRODUCTION TO PUBLIC HEALTH INFORMATICS

Date: 21th March, 2021

Data Collection and Access

Provide an overview of primary, secondary, and emerging sources that may benefit your theme and the PH agencies and players involved.

Most of the data of Covid-19 are secondary data which is created by the agencies and the Government to access it. Some are open Source’s data and some required licenses to access it. Following are some of the sources of the data used.

Some of the Sources used are:

https://www.ecdc.europa.eu/en/covid-19/data

https://datascience.nih.gov/covid-19-open-access-resources

https://www.cdc.gov/library/researchguides/2019novelcoronavirus/researcharticles.html

This data sources require a user to register on a given platform before accessing them

https://www.kaggle.com/allen-institute-for-ai/CORD-19-research-challenge

https://data.world/covid-19-data-resource-hub/covid-19-activity-location-population-table

Challenges and issues that may be experienced in the matter according to the 3As (e.g., access barriers

One of the main challenges that may be experienced is to access the data as most of the data sources required licensed and some are restricted. Also, biggest problem is most of the data are unstructured and required effort and strategy to clean up the data.

Challenges and issues that may be experienced in the matter according to the 3Ps (e.g., difficulty of binning data)

Fortunately, most of the Covid data are organized but the Problem will be faced in Technical phase of what exactly we required from the data. Also, challenges can be faced in Binning the data as it relies on grouping continuous numeric values into summarizing categories.