research paper1
See discussions, stats, and author profiles for this publication at: https://www.researchgate.net/publication/325594421
A literature review on NoSQL database for big data processing
Article in International Journal of Engineering and Technology · June 2018
DOI: 10.14419/ijet.v7i2.12113
CITATIONS
2 READS
1,117
3 authors:
Some of the authors of this publication are also working on these related projects:
Facial Nerve Assessment View project
Analysis of Student Drop Out Issue in FSIT at DIU using Data Mining Technique View project
Md. Razu Ahmed
Daffodil International University
3 PUBLICATIONS 2 CITATIONS
SEE PROFILE
Md. Asraf Ali
Daffodil International University
42 PUBLICATIONS 147 CITATIONS
SEE PROFILE
Kenneth Sundaraj
Universiti Teknikal Malaysia Melaka (UTeM)
172 PUBLICATIONS 1,147 CITATIONS
SEE PROFILE
All content following this page was uploaded by Md. Razu Ahmed on 06 June 2018.
The user has requested enhancement of the downloaded file.
Copyright © 2018 Md. Razu Ahmed et. al. This is an open access article distributed under the Creative Commons Attribution License, which
permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
International Journal of Engineering & Technology, 7 (2) (2018) 902-906
International Journal of Engineering & Technology
Website: www.sciencepubco.com/index.php/IJET doi: 10.14419/ijet.v7i2.12113
Research paper
A literature review on NoSQL database for big data processing
Md. Razu Ahmed 1, Mst. Arifa Khatun 1, Md. Asraf Ali 1 *, Kenneth Sundaraj 2
1 Department of Software Engineering, Daffodil International University, Dhaka, Bangladesh
2 Faculty of Electronics and Computer Engineering, Universiti Teknikal Malaysia Melaka, Malaysia
*Corresponding author E-mail: [email protected]
Abstract
Objective: Aim of the present study was to literature review on the NoSQL Database for Big Data processing including the structural
issues and the real-time data mining techniques to extract the estimated valuable information.
Methods: We searched the Springer Link and IEEE Xplore online databases for articles published in English language during the last
seven years (between January 2011 and December 2017). We specifically searched for two keywords (“NoSQL” and “Big Data”) to find
the articles. The inclusion criteria were articles on the use of performance comparison on valuable information processing in the field of
Big Data through NoSQL databases.
Results: In the 18 selected articles, this review identified 8 articles which provided various suitable recommendations on NoSQL databases
for specific area focus on the value chain of Big Data, 5 articles described the performance comparison of different NoSQL databases, 2
articles presented the background of basics characteristics data model for NoSQL, 1 article denoted the storage in respect of cloud compu-
ting and 2 articles focused the transactions of NoSQL.
Conclusion: In this literature, we presented the NoSQL databases for Big Data processing including its transactional and structural issues.
Additionally, we highlight research directions and challenges in relation to Big Data processing. Therefore, we believe that the information
contained in this review will incredible support and guide the progress of the Big Data processing.
Keywords: Big Data; Data Processing; Hadoop; Mongo DB; NoSQL.
1. Introduction
We are living an era of data ocean. The past 25 years, Data has
raised in a massive scale in diverse fields including software based
medical rehabilitation system [1] and sports coaching [2]. Accord-
ing to the report of International Data Corporation (IDC), the over-
all created data in the world will reach 44 ZB or trillion gigabytes
during the time of 2013 to 2020 [3]. This vast amount of data refers
to a Big Data that is a global buzzword. However, if we do not pro-
cess Big Data, the outcome of their invisible data would be missed.
It is one of the big problem for Big Data analysis and giant compa-
nies [3]. It cannot be effectively & efficiently managed using tradi-
tional database management tools [4]. To handle this problem, users
have a number of options how to reach the problem related with
such data. For example, to store and process vast scale datasets us-
ers can use various database technology including NoSQL data-
bases [5]. NoSQL originally started off a simple combination of two
words ‘No’ and ‘SQL’ explained as the ellipsis of not only SQL [6].
Hence, NoSQL is a generic term used to refer to any data store or
process that does not follow the traditional model of relational da-
tabase management system [7]. There are 4 basic types of NoSQL
databases includes key-value store, document-based store, and
graph-based store [8].The key-value store NoSQL database basi-
cally, uses a hash table in which there exists a unique key and value
to a specific data. The values are identified and retrieved through a
key, and stored values can be numbers, strings, JSON, XML,
HTML, binaries, images, videos and few others [7]. Document-
Store NoSQL Database, stores each record and data within a single
document. A document-store NoSQL database is used for storing,
recovering, and handling semi-structured data [8]. In column-ori-
ented NoSQL database, data is stored in cells grouped in columns
of data. Columns are logically grouped into column families. A
Graph-based NoSQL database that uses relationships and nodes to
represent and store data. NoSQL database system are emerging be-
side main internet and IT company, such as Google, Amazon, Fa-
cebook, Alibaba, IBM; which company are dealing with huge
amount of data with traditional Relational Database System could
not handle. Therefore, the aim of the study was to help users, espe-
cially to obtain an independent understanding of the strengths and
weakness of NoSQL database approach and where we able to im-
prove for managing huge volume of data.
2. Methods
2.1. Article searching procedure
We used a systematic searching procedure to identify all of the
available articles that discuss the storing and managing data for Big
Data situation using NoSQL databases. In our systematic searching
procedure, we searched two keywords from the Springer Link and
IEEE Xplore digital databases in order to assess the article. Firstly,
we used the keyword “NoSQL” to find journal articles published in
the English language between years 2011 to 2017. We then used the
keyword “Big Data” within obtained set of search results to further
narrow the set of analyzed journal article.
2.2. Article inclusion and exclusion benchmark
International Journal of Engineering & Technology 903
For the final preference of articles that applied the NoSQL database
system for Big Data management. We used some benchmark to in-
clude and exclude articles from the set of articles that were selected
through the search of IEEE Explore and Springer Link online data-
bases. To include and exclude articles from the set of articles found
through our systematic searching technique, we read the title, ab-
stract, methodology and results of each article. We considered only
those articles that were written in English and that used NoSQL Da-
tabases. The exclusion criteria were the following: 1) Article that
applied NoSQL database other than Big Data management, 2) Ar-
ticle that managed Big Data using other than NoSQL database.
2.3. Data extraction
We carefully read and analyzed all of included articles to minutes
the key information. We followed a standard data extraction from
for the particular analysis of each article. Two of the authors of this
study (RA and AK) used our designed standard data extraction form
to track the key information and compared them in order to confirm
the accuracy of the extracted records. Each article was evaluated for
the following key information: (1) Performance comparison for dif-
ferent NoSQL databases for Big Data processing, (2) Overview of
different NoSQL databases and Big Data processing technologies,
(3) Database Engine Ranking, (4) Data Models of NoSQL Data-
bases, (5) Transactions on NoSQL databases.
2.4. Research questions
The final set of articles was used to answer the following questions:
1) Which is the best NoSQL Databases system for Big Data Pro-
cessing? 2) What is the main structural issue in the Big Data pro-
cessing? 3) Are there any transactions possible on NoSQL Data-
bases?
3. Results
3.1. Article search results
We used our systematic article searching procedure and found 18
articles that have published in reputed journals and conferences be-
tween January 2011 and December 2017. We then scanned all the
articles particularly and identified the key points of each. We search
for two keywords (“NoSQL Database” and “Big Data”) to find of
the articles. The search of the Springer Link and IEEE Xplore elec-
tronics databases using this keyword “NoSQL” retrieved 395 and
568 articles respectively.
Fig. 1: Article Search Results.
We then refined search using the keyword “Big Data”, and this
search resulted in the retrieval of 280 and 245 from the Springer
Link and IEEE Xplore databases, particularly. We then read the ti-
tle, abstract, keyword, and methodology of each article, and based
on our inclusion & exclusion criteria, we selected 10 articles from
the IEEE Xplore database and 8 articles from the Springer Link Da-
tabase for analyzing the present study. The article search result is
summarized in Figure 1. Thus, as a result of this searching proce-
dure, a total of 18 articles that discussed NoSQL database for Big
Data management.
3.2. Descriptive analysis
From the collected of 18 articles, we found 8 articles [8-15] that
were related to general overview of NoSQL databases and Big
Data, 5 articles [16-20] that discussed performance comparison and
evaluation
between different NoSQL databases, 2 articles [21-22] that were
associated to NoSQL data model and classifying NoSQL databases
based on the Consistency, Availability, tolerance of network Parti-
tion of database which is known as CAP theorem, [23] this research
has concentrated on the storage aspect of cloud computing systems
using NoSQL. Other
two [24-25] study reported the SQL query condition transformation
to any NoSQL based database using Espresso Heuristic algorithm
and analyzed the effects of transaction on data consistency & effi-
ciency. However, the outcomes of the 18 articles on NoSQL used
in Big Data processing are summarized in Table 1.
904 International Journal of Engineering & Technology
Table 1: Key Points of Each Article
Refer-
ence Approach Study
[8-15] General overview of NoSQL and Big Data This study provides a recommendation on the suitable databases for specific
type of application requirement & focus on the value chain of Big Data.
[16- 20]
Performance comparison and evaluation on different NoSQL Databases.
Discussion on NoSQL best use cases and NoSQL Databases performance meas- urement.
[21-
22]
Classifying NoSQL Databases according to the CAP
theorem and Data Model.
This study describes the background basics characteristics data model of
NoSQL
[23] DB-Engine Ranking This research has concentrated on the storage aspect of cloud computing sys-
tems, in particular, NoSQL Databases
[24- 25]
Transactional for MongoDB, Riak and NoSQL ‘s SQL condition based on Espresso Heuristic algorithm
To analyzed the effects of transaction on data consistency and efficiency and SQL query condition transformation for any NoSQL Databases
3.3. Research question 1: which is the best NoSQL data-
bases system for big data processing?
Two studies [17, 18] provided a recommendation on the suitable
NoSQL databases for specific type of applications. E.Tang et al.
[18] experimented between five NoSQL(Redis, MongoDB, Couch-
base, Cassandra, HBase) databases based on database type and pop-
ularity, and they created a 4-node cluster for each database in ex-
periment (using YCSB = Yahoo! Cloud Serving Benchmark). They
showed loading time while 100000 records load into databases. Re-
dis got top performance which is 1.31 times quicker than MongoDB
. Then, two column-based databases, Hbase and Cassandra, were
1.86 times and 1.71 times lengthier than Redis. The poor perfor-
mance was presented by Couchbase database [18]. And they were
consider workload execution whereas Redis also displayed best per-
formance with the average time execution 1.1 seconds. Other study
[17] described into their article, If data exists in XML format and
need to achieve high level consistency then MongoDB would pro-
vide the best solution, If data are unstructured and need to high per-
formances then Redis would be the best solution, and If processing
data are a high volume of data then Cassandra would be the best
solution.
3.4. Research question 2: what is the structural issue in
the big data processing?
There is some structural issue of Big Data processing. Important
things are how to effectively collect data and store data, and how to
worth for it. These two things are definitely important for Big Data
processing. Based on the above mentioned point, we found 2 stud-
ies [10, 13] that discussed the Big Data issues and challenges. M.
Chen et al. [13] discussed in their survey, The world of Big Data is
heterogeneous and continuously it changing alongside with IoT.
Most of the data are having also wireless data, but there is no per-
manent gateway to collect data from this wireless data. Moreover,
the study [10] noted that Big Data volume grows so large and di-
verse and all data does not need to store for analysis rather how it
is recognized to deal with it.
3.5. Research question 3: are there any transactions pos-
sible on NoSQL databases?
Of the 18 selected studies, two studies [24, 25] focused on the ACID
transactions. The primary models of NoSQL do not support ACID
transaction but Gonzalez-Aparicio et al. [25] developed a new
transaction system and implemented into NoSQL databases for
MongoDB and Riak for ACID transaction; These transaction sys-
tem allows join operation, provide high scalability and concurrency
of transaction. Other study [24] described the SQL syntax convert-
ing into conditional expression of a NoSQL database.
4. Discussion
We studied particularly the selected 18 articles on “NoSQL & Big
Data related technologies” for the analysis of the present processes
to identify future research in the area. The current study presents a
summary of the data found in the literature that focused on perfor-
mance, strengths and weakness of NoSQL databases for Big Data
Processing. We summarized each articles of this paper are pre-
sented as follows:
4.1. Characteristics of NoSQL databases
Most of the traditional database system are based on transactions.
These transactional features are also familiar as ACID (Atomicity,
Consistency, Isolation, Durability) [26]. However, Big transac-
tional process does not work properly with ACID system [27].
Hence, ACID system shown to be a problem in different distributed
systems that are not fully solvable. Therefore, Eric Brewer [28] in-
troduced the CAP theorem (Figure 1) which is more efficient in dif-
ferent distributed systems. But, later the study [26] noted that the
CAP theorem is accepted only two attributes among the three re-
quirements (AP, CP, CA) for Big Data processing at a time. The
more details are following:
• Available and Partition-Tolerant (AP): Achieve "eventual consistency" through reiteration and authentication. Exam-
ple: Voldemort, Couch DB, Cassandra etc.
• Consistent and Partition-Tolerant (CP): CP system have trou- ble with availability whereas keeping data consistent across
partitioned nodes. Example: MongoDB, Redis, BigTable etc.
• Consistent and Available (CA): CA system have problem with partitions and typically deal with replication. Example:
Vertica, MySQL etc.
Fig. 2: Illustration of the CAP Theorem {Source: [29]}.
By the above discussion, it is very important to know about The
CAP theorem for designing any distributed system. For example,
when transactional and ACID issues are coming in NoSQL data-
base, there is no other option without CAP theorem. Later, Gonza-
lez-Aparicio et al [25] developed a new transaction system using
three components: i) Transmission Processing Engine (TPE), ii)
Data Management Store (DMS), and iii) Times Stamp Manager
(TSM). Also, they implemented their developed technique into
NoSQL databases for MongoDB and Riak; where TPE allows join
operation which are not previously supported in NoSQL, whereas
DMS and TSM are applied in order to provide high scalability and
concurrency of transactions. Other study [24] described the Es-
presso heuristic algorithm for converting SQL syntax to a condi-
tional expression of a specific NoSQL Database (MongoDB). How-
ever, some of NoSQL databases support ACID transaction, such as
International Journal of Engineering & Technology 905
Mark Logic [30] is a solution that works like relational and NoSQL
databases.
4.2. Comparison of NoSQL databases
In this present study, we provide evaluation of four categories
NoSQL databases including document based (MongoDB, Couch-
Base), column based (Cassandra, HBase), key value base (Redis)
and graph based (Neoj4). Moreover, MongoDB, CouchBase, Cas-
sandra, HBase, Redis are the most prominent NoSQL databases that
are evaluated using YCSB [16, 18]. Surya et al. [16] showed their
experiment for MongoDB, CouchBase, Cassandra, HBase using
YCSB, where Cassandra had a better performance than others while
the execution of workload is 50% read and 50% write. They also
found that HBase had a better performance with a small datasets of
database, whereas MongoDB had the best performance while work-
ing with 100% read and 100% blind write [16]. EnqingTang et al.
[18] described their experiment for performance measurement be-
tween different NoSQL databases including Radis, MongoDB,
CouchBase using YCSB and noted that Redis is particularly appro-
priate for loading and executing workloads for small datasets, but
Redis have poor performance for the issues of vast amount of da-
tasets. The study [8] worked with various NoSQL databases includ-
ing Redis, MongoDB, CouchBase and noted that Redis is the best
suited for the analysis of small amount of datasets in order to get
high performance, and for processing large amount of data Mon-
goDB is the best choice, whereas CouchBase is better for fault-tol-
erant database environment. However, CoucheBase is applied for
working concurrently read and write operation in large scale data
sets but performance is not good enough, in this case it is recom-
mended to apply Cassandra [17].
4.3. Big data processing using NoSQL
Generally, Big Data is an issue when the size of the dataset crosses
the ability of existing software in order to analyze dataset including
data collecting, processing, retrieving and managing. According to
the study of [31] the Big Data frequently described using 5 Vs( Vol-
ume, Velocity, Variety, Veracity, Value) are following:
• Volume (great volume): The large amount of data sets are created in every second. As a result, data sets are used to in-
crease time over time. This increasingly makes data sets too
large in order to store and analyses for traditional database
technology. But, considering the Big Data technology, we are
able to store and use these kind of large data sets with the
help of distributed systems, where parts of the data is stored
in different locations and brought together by software.
• Velocity (rapid procreation): Velocity refers to the speed at which new data is generated and the speed at which data
moves around. Big data technology allows us to analyze the
data while it is being generated, even without putting it into
databases.
• Variety (various types): With the Big Data technology, we are able to join differed types of data (structured and unstruc-
tured) including messages, social media conversations, pho-
tos, sensor data, video or voice recordings and bring them to-
gether with more traditional, structured data.
• Veracity: It refers to the messiness or constancy of the data. With many forms of big data, quality and accuracy are less
controllable (just think of Twitter posts with hash tags, ab-
breviations, typos and colloquial speech as well as the relia-
bility and accuracy of content) but Big Data and analytics
technology allows us to work with these type of data. The
volumes often make up for the lack of quality or accuracy.
• Value (huge value but very low density): Value is also im- portant to take into account when looking at Big Data. It is
well and good having access to Big Data but unless turns it
into a value otherwise it is useless. It is noted that businesses
make a business case for any attempt to collect and leverage
big data. It is so easy to fall into the buzz trap and embark on
Big Data initiatives without a clear understanding of costs
and benefits.
Moreover, the study [32] added another 2 Vs (Variability,
Visualization) with the above noted 5V’s are following:
• Variability: Variability is different from variety. For exam- ple, a coffee shop may offer 6 different blends of coffee, but
if we get the same blend every day and it tastes different
every day, that is variability. The same is true for data sets, if
the meaning is constantly changing, it can have a huge impact
on data homogenization.
• Visualization: Visualization is complex in today’s world. Us- ing charts and graphs to visualize huge amounts of complex
data is much more effective in conveying meaning than
spreadsheets and reports chock-full of numbers and formulas.
However, the amount of data has been generating more and more
and therefore data sets analysis become more complex. This chal-
lenge is not only collecting and managing vast amount of data, it
also more challenging to extract valuable data [33]. Hence, extract-
ing valuable data is important issue that need to process by four
phases are presented in Figure 4.
Fig. 3: Big Data Life Cycle.
In the generation phase of Big Data processing, various type of raw
data sets are created by different sources. The second phase is ac-
quisition that involve for data collecting and pre-processing in order
to transmit the data sets as the Big Data to the storage phase. Gen-
erally, storage phase is used to store the Big Data using Distributed
file system (DFS). Recently, there are different types of DFS avail-
able for big data processing such as HDFS, GFS, and TFS. Finally,
the Big Data production is the most important for analysis of Big
Data sets which is used to perform through batch-based (Map-Re-
duce), BSP-based, or stream based data processing techniques [34].
However, real-time analysis and small data sets analysis are com-
plex in Hadoop Distributed System. Thus, the real time and non-
files data set analysis is used to perform through NoSQL database.
Hence, to handle the vast amount of data sets, it is require to con-
sider some issues and challenges are following:
• Extremely difficult to effectively collecting and processing of big data.
• If data volume grows so large and diverse, then there is no particular technique to deal with it.
• Slower Decision making, automatic data mining process will be more improved.
• To collecting wireless data, there is no stable gateway. In summary, we highlight research directions and challenges in re-
lation to Big Data processing and storage management system by
NoSQL. Which is emerging impact of Big Data analysis. In NoSQL
databases, transactional issue into NoSQL database and structural
gap between cloud infrastructures can be more improved. Here, we
describe major databases features that require further research in
terms of Big Data management.
5. Conclusion
In this paper, we have described NoSQL database in Big Data tech-
nologies. We categorized different NoSQL databases, particularly
CAP theorem and then we discussed about the Big Data life cycle.
Moreover, we have discussed the strengths, weakness, comparison
and evaluation on various NoSQL databases. The pointing out from
our discussion would be helpful to the business leader for selecting
an appropriate NoSQL database in order to store and manage the
906 International Journal of Engineering & Technology
Big Data. In addition, we have focused research directions and chal-
lenges in relation to the Big Data in storage and management sys-
tem through NoSQL. Although, several techniques have been ap-
plied for Big Data processing through NoSQL, their use for real-
time data mining process and to extract estimated valuable infor-
mation may still be compromised by the factor highlighted in the
present study.
6. Competing interest
The authors declared that they have no competitive interest
References
[1] N. U. Ahamed, K. Sundaraj, R. B. Ahmad, M. Rahman, and A. Ali, A Framework for the Development of Measurement and Quality As-
surance in Software-Based Medical Rehabilitation Systems, Proce-
dia Engineering. 2012; 41: 53–60. https://doi.org/10.1016/j.pro- eng.2012.07.142.
[2] N. M. Khan, Realtime coaching system, U. S Patent No. 8,279,051, 2012.
[3] A. A. Safaei, Real-time processing of streaming big data, Real-Time System. 2017 53(1):1–44. https://doi.org/10.1007/s11241-016-9257-
0. [4] InFocus Blog | Dell EMC Services. Big Data and NoSQL: The Prob-
lem with Relational Databases. https://info-
cus.emc.com/april_reeve/big-data-and-nosql-the-problem-with-re- lational-databases/. Accessed August 7, 2017.
[5] J. Pokorny, New Database Architectures: Steps towards Big Data Processing, Proc. IADIS Eur. Conf. Data Min. (ECDM’13), António Palma dos Reis Ajith P. Abraham Eds., IADIS Press. 2013 3–10.
[6] M. Stonebraker, SQL databases v. NoSQL databases, Commun. ACM. 2010 53(4):10-11. https://doi.org/10.1145/1721654.1721659.
[7] Tech Republic. 10 things you should know about NoSQL databases. http://www.techrepublic.com/blog/10-things/10-things-you-should-
know-about-nosql-databases/. Accessed August 8, 2017.
[8] J. Bhogal and I. Choksi, Handling Big Data Using NoSQL. in Pro- ceedings - IEEE 29th International Conference on Advanced Infor-
mation Networking and Applications Workshops, WAINA 2015, 393–398: IEEE
[9] R. Zafar, E. Yafi, M. F. Zuhairi, and H. Dao, Big Data: The NoSQL and RDBMS review, in ICICTM 2016 - Proceedings of the first In- ternational Conference on Information and Communication Technol-
ogy, 2017; 120–126.
[10] I. Chebbi, W. Boulila, and I. R. Farah, Big Data: Concepts, Chal- lenges and Applications, Springer, Cham, 2015 638–647.
[11] S. D. Kuznetsov and a. V. Poskonin, NoSQL data management sys- tems. Program. Comput. Softw. 2014 40(6): 323–332. https://doi.org/10.1134/S0361768814060152.
[12] V. Rajaraman, Big data analytics, Resonance. 2016 21(8):695–716. [13] M. Chen, S. Mao, and Y. Liu, Big data: A survey, in Mobile Net-
works and Applications, 2014; 19 (2):171-209.
https://doi.org/10.1007/s11036-013-0489-0.
[14] F. Gessert, W. Wingerath, S. Friedrich, and N. Ritter, NoSQL data- base systems: a survey and decision guidance, Comput. Sci. - Res.
Dev. 2017; 32(3–4): 353–365.
[15] L. Wu, L. Yuan, and J. You, Survey of Large-Scale Data Manage- ment Systems for Big Data Applications, J. Comput. Sci. Technol.,
2015; 30(1): 163–183. https://doi.org/10.1007/s11390-015-1511-8.
[16] S. Swaminathan and R. Elmasri, Quantitative analysis of scalable nosql databases, 2016 IEEE International Congress on Big Data
(BigData Congress), 2016: IEEE.
[17] P. Srivastava, S. Goyal, and A. Kumar, Analysis of various NoSql database, 2015 International Conference on Green Computing and
Internet of Things (ICGCIoT), 2015: IEEE. https://doi.org/10.1109/ICGCIoT.2015.7380523.
[18] E. Tang and Y. Fan, Performance Comparison between Five NoSQL Databases, in 2016 seventh International Conference on Cloud Com- puting and Big Data (CCBD), 2016; 105–109: IEEE.
[19] D. Seybold, N. Wagner, B. Erb, and J. Domaschka, Is elasticity of scalable databases a Myth? , in 2016 IEEE International Conference on Big Data (Big Data), 2016 2827–2836: IEEE.
[20] V. N. Gudivada, D. Rao, and V. V. Raghavan, NoSQL Systems for Big Data Management, 2014 IEEE World Congr. Serv., 2014; 190– 197: IEEE. https://doi.org/10.1109/SERVICES.2014.42.
[21] A. Mohan, M. Ebrahimi, S. Lu, and A. Kotov, A NoSQL Data Model for Scalable Big Data Workflow Execution, in 2016 IEEE Interna-
tional Congress on Big Data (BigData Congress), 2016;52–59:IEEE.
[22] J. Han, E. Haihong, G. Le, and J. Du, Survey on NoSQL database, in Proceedings - 2011 6th International Conference on Pervasive Com- puting and Applications, ICPCA 2011, 2011; 363–366:IEEE.
[23] K. Grolinger, W. a Higashino, A. Tiwari, and M. A. Capretz, Data management in cloud environments: NoSQL and NewSQL data stores, J. Cloud Comput. Adv. Syst. Appl., 2013 2, 22.
https://doi.org/10.1186/2192-113X-2-22. [24] Changqing Li and Jianhua Gu, A distinctive transformation approach
of NoSQL’s SQL conditions based on Espresso, in 2017 IEEE 2nd
International Conference on Cloud Computing and Big Data Analy- sis (ICCCBDA), 2017; 61–69: IEEE.
[25] M. T. Gonzalez-Aparicio, A. Ogunyadeka, M. Younas, J. Tuya, and R. Casado,Transaction processing in consistency-aware user’s appli- cations deployed on NoSQL databases, Human-centric Comput. Inf.
Sci., 2017; 7(1): seven.
[26] j. Han, E. Haihong, G. Le, and J. Du, Survey on NoSQL database, in Proceedings - 2011 6th International Conference on Pervasive Com-
puting and Applications, ICPCA 2011, 2011;363–366:IEEE.
[27] A. B. M. Moniruzzaman and S. A. Hossain, NoSQL Database: New Era of Databases for Big data Analytics - Classification, Character-
istics and Comparison, 2013; arXiv preprint arXiv: 1307.0191.
[28] E.A. Brewer. Towards robust distributed systems. InPODC, 2000; seven.
[29] TheTechSolo. CAP Theorem. https://thetechsolo.files.word- press.com/2016/02/captheorem.png?w=640. Accessed September 19, 2017.
[30] Integrate Data Silos with a NoSQL Database | What Is MarkLogic. http://www.marklogic.com/what-is-marklogic/.Accessed:August 8, 2017.
[31] LinkedIn Pulse. B. Marr, Big Data: The five Vs Everyone Must Know, 2014.
[32] Affect Radius.The 7 V’s of Big Data https://www.impactra- dius.com/blog/7-vs-big-data/.Accessed: September 19, 2017.
[33] S. Sagiroglu and D. Sinanc, “Big data: A review, in 2013 Interna- tional Conference on Collaboration Technologies and Systems
(CTS), 2013; 42–47: IEEE.
https://doi.org/10.1109/CTS.2013.6567202. [34] Distributed File System. https://www.slideshare.net/Lycca/distrib-
uted-file-system.Accessed January 3, 2018.
View publication statsView publication stats