discussion

profilestudy098

 

Diverse structured, unstructured, and semi-structured Data that were generated from the various sources need to be reduced to the

same standard for the data to be understandable and flow among diverse systems involved in processing the data.

Question:

Big Data consists of heterogeneous datasets from many sources and the datasets need to be reduced to the same format.

for systems interoperability. Some of the formatting tools include XML, AVro, JSON and Parquet.

Discus the roles XML, AVro, and JSON, which are the popular data formatting tools in Big Data standardization.

Discussion Requirements

Discuss the need for Big Data standardization.

List the various tools that can be used to achieve Big Data Standardization

What is XML? What is AVro? What is JSON?

Discuss the roles of XML, AVro and JSON in Big Data formatting.

Useful links:

XML

https://www.slideshare.net/Hadoop_Summit/bose-june26-405pmroom230cv3-24148869

Jason Intro

https://www.w3schools.com/js/js_json_intro.asp

Jason & Big Data

https://engineering.creditkarma.com/json-and-the-confusion-of-formats-in-big-data/

AVro

https://avro.apache.org/docs/current/

    • 7 years ago
    • 5
    Answer(0)