Discussion

profileStudyMonster

 

Question:

Big Data consists of heterogeneous datasets from many sources and the datasets need to be reduced to the same format.

for systems interoperability. Some of the formatting tools include XML, JSON, AVro and Parquet.

Discus the roles XML and JSON, which are the two most popular data formatting tools, in Big Data standardization.


Discussion Requirements


Discuss the need for Big Data standardization.


List the various tools that can be used to achieve Big Data Standardization


What is XML? What is JSON?


Discuss the roles of XML and JSON in Big Data formatting.


What is Avro?


Useful links:

XML

https://www.slideshare.net/Hadoop_Summit/bose-june26-405pmroom230cv3-24148869

Jason

https://engineering.creditkarma.com/json-and-the-confusion-of-formats-in-big-data/

AVro

https://avro.apache.org/docs/current/

  • 7 years ago
  • 5
Answer(1)

Purchase the answer to view it

blurred-text
NOT RATED
  • attachment
    DataStandardization.docx