Data warehousing, Big Data, Green Computing
8/18/2020 Originality Report
https://ucumberlands.blackboard.com/webapps/mdb-sa-BB5a31b16bb2c48/originalityReport/ultra?attemptId=d9711c91-a0f2-4b8c-996e-a6e59419a7a… 1/8
%31
%3
%2
%2
SafeAssign Originality Report Summer 2020 - InfoTech Import in Strat Plan (ITS-831-08) - Second Bi… • Week 7 Research Project: Portfolio Project
%37Total Score: Medium risk Ravi Teja Malepati
Submission UUID: f967a088-7285-7553-f8b7-28fe224ebc0b
Total Number of Reports
1 Highest Match
37 % Data Warehouse Architecture, Big Data &…
Average Match
37 % Submitted on
08/18/20 02:56 PM EDT
Average Word Count
2,705 Highest: Data Warehouse Architecture, Bi…
%37Attachment 1
Institutional database (10)
Student paper Student paper Student paper
Student paper Student paper Student paper
Student paper Student paper Student paper
Student paper
Internet (4)
atlantis-press termpaperwarehouse docplayer
gradesfixer
Scholarly journals & publications (2)
ProQuest document ProQuest document
Global database (2)
Student paper Student paper
Top sources (3)
Excluded sources (0)
View Originality Report - Old Design
Word Count: 2,705 Data Warehouse Architecture, Big Data & Green Computing.docx
5 6 2
7 4 1
12 14 3
10
17 9 15
8
11 13
18 16
5 Student paper 6 Student paper 2 Student paper
Running Head: DATA WAREHOUSE ARCHITECTURE, BIG DATA & GREEN COMPUTING 1
Data Warehouse Architecture, Big Data & Green Computing 9
Data Warehouse Architecture, Big Data & Green Computing Ravi Teja Malepati
Information Technology Importance in Strategic Planning
ITS-831-08
University of the Cumberlands
b
1
2
3
4
2
8/18/2020 Originality Report
https://ucumberlands.blackboard.com/webapps/mdb-sa-BB5a31b16bb2c48/originalityReport/ultra?attemptId=d9711c91-a0f2-4b8c-996e-a6e59419a7a… 2/8
Abstract
Data warehouse is a facilitator for disparate integration of the database which is operational within an enterprise to a store which is single. It also offers the knowledge workers an easy access into summarized, historical as well as various forms of data that is aggregated. The available architectures for warehouse have a flaw which is major during data warehouse de-coupling from its operational databases that are underlying. It causes two complications of complex warehouse update as well as inability of providing the consumers with a warehouse drill-down feature into data that is current. Green Computing is an ecofriendly of computers using with their resources that helps reducing the waste of dumping e-waste and helps in reducing the impact on environment. It helps in conserving low energy to manufacture the computers, while using computers and disposing computers need to be recycled. Every organization need to handle their IT infrastructure in an eco- friendly manner.
Introduction
Data architecture basically comprises of the specifications which are utilized to describe the existing state, define the data requirements, and guide integration
of data in addition to controlling the assets of data as per the strategy of data. It is considered to be of great importance since it acts as a bridge between technical execution as well as business strategy. Data architecture is strategically used in helping business to quickly evolve in addition to benefiting from the techniques which are considered to be emerging. Management of the delivery on complex information as well as data all over the organization plays the role of a functional transformation of agility in addition to change. This brings the necessity of having to analyze as well as evaluate the various types of data analytics which are in use within an organization which are: big data analytics, green data in addition to data warehouse. Green computers main concept is to reduce the conserving
energy and use the available resources efficiently. Every organization should replace their old systems that are conserving more energy and replace with the new energy efficient systems. Organization need to take sustainable measures on moving workload to energy efficient server because using power on the computers, data centers, servers and routers is becoming more. (Keri E Pearlson).
Data warehouse
Data warehouse is basically considered to be a collection of data marts which tends to represent the data which is historical from different functions within an institution. This particular data is mainly kept in an optimized structure to analyze as well as query data in form of a data warehouse. The systems for
warehouse of data like the home designs are made up of various architecture options. Several are deployed with operational data stores while others have the amounts of data. In addition, various data warehouses reference finite sources of data which reference external as well as internal sources of data which are
different (Abderrazak Sebaa, 2018). Additionally, data warehouse can be defined as an organization’s design for collecting data as well as storage. These data
should be sorted out, organized in an adequate manner in addition to being cleaned. Architecture of warehouse of data basically concentrates on getting a method which is efficient to acquire information from the set which is raw as well as having it in a structure which gives insights that are crucial in addition to being easily digestible. Its architecture comprises of the collection of data, used in the intelligence for businesses as well as making of decisions. The data is defined as time
variant, data which is oriented on subject in addition to non-updatable integrated data. The construction of warehouse of data is mainly unique to fit the business use, hence important to have the data architectures ‘common layers examined. The data component includes; source data- it has the responsibility of sourcing
data getting to data warehouses which is capable of being classified into various categories such as; archived data, internal data, and external data in addition to
production data. There is also the component for data staging which happens when the data which is extracted from sources that are different needs to be altered, converted as well as presented through a format which is suitable or querying in addition to analysis. Its basic role is data extraction, transformation of data in
addition to loading data. The operational systems data repositories comprises of data which is current needing to be structured in a processing manner which is
fast as well as efficient. Another component is the metadata which is also considered to be same as dictionary of data or a catalog of data within the database
management system.
5
5
5 6
5
6
5
6
5
6
5
7
Data dictionary includes of logical data structures. It also stores data regarding records as well as addresses in addition to information about indexes. The trends
of data warehouse are several in the digital times. There is the availability of services which are managed; they are basically a form of services which are considered to be of high levels whereby most complications are addressed through the cloud. Most of the encountered challenges are in relation to efficiency, reliability,
performance in addition to scalability which is often managed by providers of cloud services when such services that are managed are in use. In relation to the architecture of warehouse of data, it is possible to utilize the ETK services which are managed fully like the factory for Azure data as well as Amazon, or the data warehouse services that are managed like Amazon AWS, Azure SQL Data warehouse among others. in situations whereby these services are used, it is possible to find interconnectivity services within the cloud, whereby these minimize implementation efforts to a great extent. Provisioning of cloud services in addition to infrastructure templates ease the process of setting up data warehouse solutions. (Rajdeep Chowdhury, 2018). There is the use of data marts for production
lines. In big data repository which is centralized like the warehouse of data it is important to analyze for production lines that are different. Data marts basically offer a solution via containment into the warehouse of data. Another trend is the columnar storage which increases the performance of the disk in comparison to
storage which is row based during the process of retrieving the analytics complex queries. Warehouses of data are capable of offering similar abilities within the cloud like use of amazon redshift for querying as well as storage at a cost which is low. The use of these services minimizes the complexity of have data warehouses set up in addition to providing control of accesses, integrating discourses that are different in addition to others. Big data
It is considered to have brought demands that are varying into organizations. The analytics of big data is a complex process in general which evaluates the varied as well as data sets which are massive or data which is big in having information revealed. Facebook as well as Google are a good example on how management of a strategy on solid data is capable of bringing huge noticeable difference within an organization. It is very essential for institutions to have this measure in consideration since they are required to make use of insights gotten from analyzing big data in relation to decisions which are connected to the operations of the organization. The managers of the IT department are required to be in participation in the efforts of having the extraction of meaning from an organization’s big data. Due to this, the managers for IT have the responsibility of learning about the big data in addition to what is required in assisting the organization make a management strategy which is better. (Zhang, 2018). It comprises of information which is collected from the social media, information which is from devices which are internet enabled including tablets as well as smartphones, machine data, voice in addition to video recordings, as well as continuous logging and preservation of data which is not structured as well as structured. Basically, it is categorized through four Vs. which include; volume; meaning it creates huge data amounts in comparison to traditional sources of data. Variety; its data is from various sources in addition the data is created through machines and people too. Velocity; the speed which data gets generated is very fast whereby it is a continuous process throughout. Veracity; this means that the places where the data gets sourced from are many which makes it essential to have the quality as well as veracity of data tested. Big data is considered to fundamentally alter how enterprises compete as well as operate. Any organization which invests in big data in addition to successfully being able to derive substantial value from their given data have a privilege of having an advantage which is distinct over other competitors who are within the same industry this is due to a gap in performance which will be developing continuously as long as data which is relevant is being generated, techniques which are emerging in addition to channels which are digital provide enhanced acquisition as well as mechanisms for delivery, in addition, the techniques which enable easier as well as faster analysis are still developing Big data analytics concept is something that has been in existence for a long period It
5
5
5
5
8/18/2020 Originality Report
https://ucumberlands.blackboard.com/webapps/mdb-sa-BB5a31b16bb2c48/originalityReport/ultra?attemptId=d9711c91-a0f2-4b8c-996e-a6e59419a7a… 3/8
Source Matches (41)
techniques which enable easier as well as faster analysis are still developing. Big data analytics concept is something that has been in existence for a long period. It helps companies to harness their data in addition to making use of it in having the new opportunities identified. This later on leads to business moves which are more enhanced, profits which have been escalated, operations which are more effective as well as efficient in addition all these causes the customers to be happy.
The techniques of big data such as analytics which are based in cloud in addition to Hadoop are known for bringing cost benefits which are better in relation to big data storage (Thomas Erl, 2016). The impacts of big data are several. Tools that are new as well as processes are necessary to have big data managed which flows all over the enterprise in addition to processing as well as analysis. The development of new tools is important as it enables users to have the information which is essential without assistance. The components for effectively managing data are basically; security, extraction of information, storage, reconciliation of data as well as distribution of insights all over a company. Green data computing
Green computing is defined as the use of computers in addition to the resources which is environmentally eco-friendly as well as responsible. In addition it is
also considered to be the study of engineering, designing, manufacturing, utilizing in addition to disposing the devices of computing in a manner which basically minimizes their impact to the environment. These practices mainly involve implementing central processing units which are considered to be energy efficient, in addition to peripherals as well as servers and consumption of resource which is minimized in addition to the appropriate electronic waste disposal. Most of the IT manufacturers as well as vendors are continuing to invest in the process of designing computing gadgets which are energy efficient, minimizing the utilization of materials which are dangerous in addition to encouraging recycling of the gadgets which are digital. Its centers are considered important in the process of integrating sustainability measures. Though these centers are known for not generating waste commodities, this sector of high energy in addition to high water is meant to give priority to sustainability programs due to an increase in the demand for sustainability measures, facilities that are ranked efficient as well as energy which is clean. Google, Apple, Facebook are among the huge organizations that have adopted the initiative of going green through various means that are innovative which are later futuristic and constructive. Most managers are undergoing pressure to have their use of electricity minimized within their facilities because of considerations of the environment in addition to cutting down on expenses. The operating expense of a server which uses electricity is a lot in comparison to the expense of purchasing a server. Centers of metadata like Yahoo, Google, as well as Microsoft have had their data moved close to the River of Columbia on the border of Washington and
Oregon so that they can benefit from hydraulic power which is less expensive. Organizations are capable of adopting methods which can cool a climate. A number which is significant in relation to the bills of data power most come from the expenses incurred in having the machines cooled. It is a must for organizations to have their centers for big data cooled through the use of climate which is natural. This is leading to organizations making efforts to locate new centers for their data in the nations with cold weather such as Finland, Ireland as well as Norway. For example, Microsoft is putting into consideration the probability of building centers for data which can be submerged into the sea.in order to save energy which is extra. Efficiency of a server is an additional means of using green data by an organization. Apple is an organization which is known to have integrated fully the green data. This is in relation to its efforts in ensuring climate change has been combated and creating an environment which is better. This particular comprises of the data centers, retail stores, offices in addition to the facilities that are collocated within 43
countries among China, United States as well as United Kingdom. It is committed to producing energy which is 100% clean. The Apple organization in addition to its partners has made efforts to implement the process of the energy which is renewable around the globe. It normally creates energy which is renewable. These
projects depend on sources of energy which are different like rays of solar, wind farms, techniques that are emerging like cells of biogas fuels, systems for micro- hydro generation in addition to technologies for storing energy. Due to these given developments, the company of Apple is a leading example or additional organizations. (Keke Gai, 2016). Conclusion
With the need which is increasing to have organizations maximize their profits, the executives as well as managers are trying to find means of enhancing their bottom line. The success of implementing a data warehouse is measurable after the consulting firm as well as the client executes the project as per the requirements which have been agreed in addition to the scope of the project. As the computers application’s breadth as well as performance is increasing, so is the people’s awareness on the energy’s scarcity as well as cost which is needed in having them powered in addition to the materials required in having them manufactured.
8
9
5
7
5
Nevertheless, due to the fact that developments on computing are capable of enabling people as well as enterprises in adopting lifestyles as well as work styles that are greener, in relation to environmental debate computing is automatically part of the needed solution as well as the available problem.
References
Abderrazak .S, Fatima .C, Amina .N & AbdelKamel .T (2018, February 19). Medical Big Data Warehouse: Architecture and System Design, a Case Study: Improving
Healthcare Resources Distribution. Retrieved from https://doi.org/10.1007/s10916-018-0894-9
Keke .G, Meikang. Q (2016, January). Dynamic energy-aware cloudlet-based mobile cloud computing model for green computing. Retrieved from
https://www.sciencedirect.com/science/article/abs/pii/S108480451500123X
Pearlson, K, Saunders. C, Galletta. D. (2020). Managin and Using information Systems: A Stategic approach, 7th Edition. Hoboken, NJ: John Wiley &
Sons, Inc. ISBN: 978-1119560562. Chowdhury R., Bhattacharya I., De N., Saha S. (2018, July). Double Ended Bottom-Up Approach of Data Warehouse Architecture Employing Country ISO Code Engendered Cryptographic Algorithm. Retrieved from https://doi.org/10.1007/978-981-10-3953-9_14
Thomas .E, Paul .B, Wajid .K (2016, January). Big Data Fundamentals: Concepts, Drivers & Techniques. Retrieved from
https://dl.acm.org/doi/book/10.5555/2898954
Zhang, D. (2018, October). Proceedings of the 8th International Conference on Management and Computer Science (ICMCS 2018). Retrieved from ATLANTIS
PRESS: https://www.atlantis-press.com/proceedings/icmcs-18/25904185
2
10
11 12
4 1 4 4
13
14 15 16
17 14
18
8/18/2020 Originality Report
https://ucumberlands.blackboard.com/webapps/mdb-sa-BB5a31b16bb2c48/originalityReport/ultra?attemptId=d9711c91-a0f2-4b8c-996e-a6e59419a7a… 4/8
Student paper 100%
Student paper 99%
Student paper 80%
Student paper 100%
Student paper 100%
Student paper 64%
Student paper 88%
Student paper 68%
Student paper 69%
1
Student paper
DATA WAREHOUSE ARCHITECTURE, BIG DATA & GREEN COMPUTING 1
Original source
DATA WAREHOUSE ARCHITECTURE, BIG DATA & GREEN COMPUTING 1
2
Student paper
Data Warehouse Architecture, Big Data & Green Computing 9
Original source
DATA WAREHOUSE ARCHITECTURE, BIG DATA AND GREEN COMPUTING 9
3
Student paper
Data Warehouse Architecture, Big Data & Green Computing Ravi Teja Malepati
Original source
Data Warehouse Architecture, Big Data & Green Computing
4
Student paper
Information Technology Importance in Strategic Planning
Original source
Information Technology Importance in Strategic Planning
2
Student paper
University of the Cumberlands
Original source
University of the Cumberlands
5
Student paper
Data architecture basically comprises of the specifications which are utilized to describe the existing state, define the data requirements, and guide integration of data in addition to controlling the assets of data as per the strategy of data.
Original source
Data architecture has the specifications used in describing the existing state, it defines the requirements of the data, guiding data integration, and also to control data assets according to the data strategy
5
Student paper
big data analytics, green data in addition to data warehouse.
Original source
data warehouse, big data analytics and green data
5
Student paper
This particular data is mainly kept in an optimized structure to analyze as well as query data in form of a data warehouse.
Original source
Data is typically stored in a structure which is optimized to query and analyze data as a warehouse of data
6
Student paper
The systems for warehouse of data like the home designs are made up of various architecture options. Several are deployed with operational data stores while others have the amounts of data.
Original source
The systems of the data warehouse contain various architecture options While others have operational data stores some have installed data amounts
8/18/2020 Originality Report
https://ucumberlands.blackboard.com/webapps/mdb-sa-BB5a31b16bb2c48/originalityReport/ultra?attemptId=d9711c91-a0f2-4b8c-996e-a6e59419a7a… 5/8
Student paper 69%
Student paper 67%
Student paper 68%
Student paper 62%
Student paper 91%
Student paper 68%
Student paper 64%
Student paper 62%
Student paper 72%
5
Student paper
In addition, various data warehouses reference finite sources of data which reference external as well as internal sources of data which are different (Abderrazak Sebaa, 2018).
Original source
Some data warehouses are known to reference finite data sources, and they reference different external and internal data sources
6
Student paper
Additionally, data warehouse can be defined as an organization’s design for collecting data as well as storage.
Original source
Additionally, data warehouse architecture can be described as an organizational design for collecting and storing data
5
Student paper
The data is defined as time variant, data which is oriented on subject in addition to non-updatable integrated data.
Original source
it's integrated non-updatable, time- variant and subject-oriented data
6
Student paper
The data component includes;
Original source
Data staging component
5
Student paper
archived data, internal data, and external data in addition to production data.
Original source
Production data, internal data, archived data, and external data
6
Student paper
Its basic role is data extraction, transformation of data in addition to loading data.
Original source
The basic functions involved in this process are data extraction, the transformation of data, and loading of data
5
Student paper
The operational systems data repositories comprises of data which is current needing to be structured in a processing manner which is fast as well as efficient.
Original source
the info repositories for operational systems involve this data that need data structured in an efficient and fast processing manner
7
Student paper
Another component is the metadata which is also considered to be same as dictionary of data or a catalog of data within the database management system.
Original source
this is equivalent to the data catalog or data dictionary in the management system database
5
Student paper
Data dictionary includes of logical data structures.
Original source
In data, dictionary data is kept about structures of logical data
8/18/2020 Originality Report
https://ucumberlands.blackboard.com/webapps/mdb-sa-BB5a31b16bb2c48/originalityReport/ultra?attemptId=d9711c91-a0f2-4b8c-996e-a6e59419a7a… 6/8
Student paper 66%
Student paper 73%
Student paper 65%
gradesfixer 69%
termpaperwarehouse 75%
Student paper 69%
5
Student paper
Most of the encountered challenges are in relation to efficiency, reliability, performance in addition to scalability which is often managed by providers of cloud services when such services that are managed are in use. In relation to the architecture of warehouse of data, it is possible to utilize the ETK services which are managed fully like the factory for Azure data as well as Amazon, or the data warehouse services that are managed like Amazon AWS, Azure SQL Data warehouse among others.
Original source
Many challenges that data warehousing face are associated with reliability, scalability, efficiency, and performance that are managed mostly by cloud provider when using such managed services With regards to the info warehouse architecture, one can use the fully managed ETK services like Azure Data Factory and Amazon), Managed Data Warehouse Services like Azure SQL Data warehouse, Amazon Re-shift et al (Fernando, 2018)
5
Student paper
There is the use of data marts for production lines. In big data repository which is centralized like the warehouse of data it is important to analyze for production lines that are different.
Original source
Thirdly, there are data marts for the lines of production In big centralized data repository like data warehouse, it's critical analyzing for various lines of production
5
Student paper
Another trend is the columnar storage which increases the performance of the disk in comparison to storage which is row based during the process of retrieving the analytics complex queries. Warehouses of data are capable of offering similar abilities within the cloud like use of amazon redshift for querying as well as storage at a cost which is low.
Original source
Lastly, there's the difficulty of the columnar storage that enhances the disk performance compared to row-based storage when retrieving complex queries of analytics Data warehouses can provide such capabilities within the cloud-like using Amazon Redshift both for querying and storage at a lower cost
8
Student paper
Green data computing
Original source
Green Computing."
9
Student paper
Green computing is defined as the use of computers in addition to the resources which is environmentally eco-friendly as well as responsible.
Original source
Green computing is the environmentally responsible and eco-friendly use of computers and their resources
5
Student paper
Centers of metadata like Yahoo, Google, as well as Microsoft have had their data moved close to the River of Columbia on the border of Washington and Oregon so that they can benefit from hydraulic power which is less expensive.
Original source
As such, metadata centers like Google, Yahoo and Microsoft moved their data near Columbia on Washington/Oregon border to learn from a budget hydroelectric power
8/18/2020 Originality Report
https://ucumberlands.blackboard.com/webapps/mdb-sa-BB5a31b16bb2c48/originalityReport/ultra?attemptId=d9711c91-a0f2-4b8c-996e-a6e59419a7a… 7/8
Student paper 63%
Student paper 71%
Student paper 100%
Student paper 72%
ProQuest document 100%
Student paper 89%
Student paper 67%
Student paper 68%
Student paper 73%
Student paper 100%
7
Student paper
This particular comprises of the data centers, retail stores, offices in addition to the facilities that are collocated within 43 countries among China, United States as well as United Kingdom.
Original source
This achievement entails its data centers, its retail stores, and all its offices in the 43 countries, including China, India, the United States, and the United Kingdom
5
Student paper
These projects depend on sources of energy which are different like rays of solar, wind farms, techniques that are emerging like cells of biogas fuels, systems for micro-hydro generation in addition to technologies for storing energy.
Original source
Such projects are supported different energy sources which include wind farms, solar rays and emerging technologies like micro-hydro generation systems, biogas fuel cells, and energy storage technologies (Apple, 2018)
2
Student paper
Medical Big Data Warehouse: Architecture and System Design, a Case Study: Improving Healthcare Resources Distribution.
Original source
Medical big data warehouse Architecture and system design, a case study Improving healthcare resources distribution
10
Student paper
Retrieved from https://doi.org/10.1007/s10916-018- 0894-9
Original source
https://doi.org/10.1007/s00170-018- 2416-9
11
Student paper
Dynamic energy-aware cloudlet-based mobile cloud computing model for green computing.
Original source
Dynamic Energy-aware Cloudlet-based Mobile Cloud Computing Model for Green Computing
12
Student paper
Retrieved from https://www.sciencedirect.com/science/a rticle/abs/pii/S108480451500123X
Original source
Retrieved from https://www.sciencedirect.com/science/a rticle/abs/pii/S0952197616302251
4
Student paper
Pearlson, K, Saunders.
Original source
Pearlson, K., Saunders, C., Galletta, D
1
Student paper
Managin and Using information Systems:
Original source
Managing and Using Information Systems
4
Student paper
A Stategic approach, 7th Edition.
Original source
A strategic approach, 7th Edition
4
Student paper
John Wiley & Sons, Inc.
Original source
John Wiley & Sons, Inc
8/18/2020 Originality Report
https://ucumberlands.blackboard.com/webapps/mdb-sa-BB5a31b16bb2c48/originalityReport/ultra?attemptId=d9711c91-a0f2-4b8c-996e-a6e59419a7a… 8/8
ProQuest document 72%
Student paper 100%
docplayer 70%
Student paper 90%
atlantis-press 100%
Student paper 100%
Student paper 91%
13
Student paper
Retrieved from https://doi.org/10.1007/978-981-10-3953- 9_14
Original source
Retrieved from https://doi.org/10.1007/978-981-287-832- 8
14
Student paper
Big Data Fundamentals:
Original source
Big Data Fundamentals
15
Student paper
Concepts, Drivers & Techniques.
Original source
Concepts and Techniques
16
Student paper
Retrieved from https://dl.acm.org/doi/book/10.5555/289 8954
Original source
Retrieved from https://dl.acm.org/doi/book/10.5555/269 2341
17
Student paper
Proceedings of the 8th International Conference on Management and Computer Science (ICMCS 2018).
Original source
Proceedings of the 8th International Conference on Management and Computer Science (ICMCS 2018)
14
Student paper
Retrieved from ATLANTIS PRESS:
Original source
Retrieved from Atlantis Press
18
Student paper
https://www.atlantis- press.com/proceedings/icmcs- 18/25904185
Original source
Retrieved from https://www.atlantis- press.com/proceedings/icmcs- 18/25904185