For WizardKim-DPT1
Does big data mean big knowledge? Integration of big data analysis and conceptual model for social commerce research
Xuemei Tian1 • Libo Liu1
Published online: 1 October 2016
� Springer Science+Business Media New York 2016
Abstract The Big Data era has descended on many communities, from govern- ments and e-commerce to health organizations. Information systems designers face
great opportunities and challenges in developing a holistic big data research
approach for the new analytics savvy generation. In addition business intelligence is
largely utilized in the business community and thus can leverage the opportunities
from the abundant data and domain-specific analytics in many critical areas. The
aim of this paper is to assess the relevance of these trends in the current business
context through evidence-based documentation of current and emerging applica-
tions as well as their wider business implications. In this paper, we use BigML to
examine how the two social information channels (i.e., friends-based opinion
leaders-based social information) influence consumer purchase decisions on social
commerce sites. We undertake an empirical study in which we integrate a frame-
work and a theoretical model for big data analysis. We conduct an empirical study
to demonstrate that big data analytics can be successfully combined with a theo-
retical model to produce more robust and effective consumer purchase decisions.
The results offer important and interesting insights into IS research and practice.
Keywords Big data � Business intelligence � Customer knowledge management � Customer purchase behavior
& Xuemei Tian [email protected]
Libo Liu
1 Department of Business Technology and Entrepreneurship, Faculty of Business and Law,
Swinburne University of Technology, Melbourne, Australia
123
Electron Commer Res (2017) 17:169–183
DOI 10.1007/s10660-016-9242-7
1 Introduction
Since the 1960s, the interplay between hardware, software, and communications has
led to previously unforeseen advances in information systems. These developments
have been accompanied by shifts in the relationship between ‘carrier and content’,
and between manifestations of the latter as various data, information, knowledge,
and indeed wisdom [1]. During this time, concerns over data and information
overload increased, along with those focused on related organizational and
managerial challenges [2].
One response to these concerns saw the emergence of sub-disciplines, such as
competitive and business intelligence, and data, information, and knowledge
management [3–5]. Others came from management and organizational theory,
notably the resource-based theory, which emphasized the importance of internal
capabilities and competencies [6, 7], and later, knowledge-based views [8, 9], which
focused on the heterogeneous bundles of intangible resources (i.e., imperfectly
mobile, imperfectly imitable and non-substitutable) as a basis for competitive
advantage and organizational success.
As understanding of the nexus between information technology, information
systems, and business has increased, another core lesson has been learned—that
technology on its own is unlikely to provide the answer to the questions continually
posed by the management [10]. Whether it is relational databases, data mining, or
enterprise systems, the latest ‘big thing’ usually turns out to be much less of a
transformative force when introduced to the market. This is as likely to be true for
the latest such breakthrough, namely Big Data [11]. How to better utilize big data
assets, in addition to business assets, and human capital, to create value has become
a fertile ground for corporate competitive advantage. As big data analysis becomes
the next frontier for advancement of knowledge, innovation, and enhanced decision-
making process, the significance of its impact on society as a whole should not be
underestimated [12].
The aim of this paper is to assess the relevance of these trends in the current
business context through evidence-based documentation of current and emerging
applications as well as their wider business implications. In this paper, we use the
BigML software package to investigate how two social information channels (i.e.,
friends-based and opinion leaders-based social information) influence consumer
purchase decisions on social commerce sites. We undertake an empirical study by
combining a theoretical model with the analysis of big data using BigML. This
methodology demonstrates that big data analytics can be successfully combined
with a theoretical model to produce more robust and effective consumer purchase
decisions. However, for this to happen, big data technologies must be further
developed to cope with the proposed big data analysis framework.
The rest of the paper is organized as follows. In Sect. 2, we provide a review of
the literature related to big data, business intelligence, social information processing
theory and online consumer reviews. In Sect. 3, we identify the research gap based
on the literature review. In Sect, 4, we conduct an empirical study to develop and
explain our methodology and compare the results from different models. In Sect. 5,
170 X. Tian, L. Liu
123
we provide robust checks for our final model and conclude by discussing the wider
implications of our study.
2 Literature review
2.1 Big data and business intelligence
Big data has been labelled as the fifth wave in the technology revolution, after the
mainframe, the PC, the Internet, and Web 1.0 eras, and more recently, mobile and
Web 2.0 eras [13]. It combines an architectural paradigm shift in data movement in
which instead of bringing data for centralized computation, the aim is to push the
computation to distributed locations [14]. Big data analytics have been used to
describe data sets and analytical techniques in applications that are so large (from
terabytes to exabytes) and complex (from sensor to social media data) that they
require advanced and unique data storage, management, analysis, and visualization
technologies [12]. Considering the seemingly limitless spectrum, big data can
utilized for social networks, web server logs, traffic flow sensors, satellite imagery,
broadcast audio streams, banking transactions, MP3 s of rock music, the content of
web pages, scans of government documents, GPS trails, telemetry from automo-
biles, financial markets and so on [15]. Dumbill (2012) explains that although any
data can be regarded as ‘Big’ when size is too large to be handled by conventional
systems, it is also the case that the dramatically increased volume (there is too much
of it), velocity (it is moving too fast), and variety (it is not structured in a useable
way) do not longer fit in the structures of database architectures [12, 15]. More
recently, three more characteristics have been mentioned by other researchers, such
as Daniel [16]: (1) value (a source of competitive advantage), (2) veracity (the
biases, noise and abnormality in data), and (3) verification (refers to data verification
and security). Daniel [16] argues that these three attributes also represent Big Data’s
fundamental properties as they are linked to data accuracy, a concept associated
with the longevity of data and their relevance to analysis outcomes, as well as the
length required to store data in a useful form for appropriate value-added analysis.
Clearly, today more than ever, data is a key resource and corporations are striving
to generate it at innumerable data points to create value and obtain competitive
advantage [17]. As will be seen below, further iterations of the concept are ongoing,
as through a combination of technological change, regulatory processes, and
managerial decision making, businesses are collecting and storing data at an
astonishing rate. In turn this is enabling organizations to employ business
intelligence to make faster and more reliable information-based business decisions
[18].
It is claimed that the advent of big-data technologies has rendered obsolete the
separation of information-centric systems, such as data warehouses (once reserved
for strategic decision support) and those, like enterprise resource systems
(supporting daily operations) [19]. The integrated management of transactional
and operational data, especially in new post-bureaucratic structures promises
improved business value through better-informed decisions, the discovery of hidden
Does big data mean big knowledge? Integration of big data… 171
123
insights and automation of business processes [20]. In addition, the resultant
decisions can be made on the basis of data and rigor rather than gut-feeling [12],
albeit increasingly in situations where big data is placing transformative data
discovery and advanced analytics tools into the hands of customers [21] and where
concerns have already been expressed about risks to the security of proprietary
knowledge [22].
Furthermore, while there is plenty of enthusiasm in business circles for big data,
there are those who argue that there is little special about it, seeing it as merely a
magnification of the features found in knowledge management, and others
predicting that its popularity will eventually wane owing to its complexity and a
shortage of qualified workers [23]. It is also important to balance matters of data
size against the context of its application. A survey conducted jointly by IBM and
Oxford University Business School revealed that 30 % of respondents did not know
what ‘big’ meant for their organizations [24]. Elsewhere it has been observed that
considerations of adequacy and relevance might require ‘not so big data’ [25],
something that might apply to smaller businesses, who are also expected to adopt
these technologies, for example through SaaS or as a component of ERP systems
[26].
An important component of the ongoing case for big data is calls for education
and training of a new breed of statisticians and analysts for the proper use of BI
applications [27], big data scientists operating as multifunctional problem solvers
communicating between different departments [28, 29]. Likewise, the professional
development and career progression of in-house analysts—already familiar with the
organization’s unique business processes and challenges—has also been identified
as a top priority for business executives [24]. This might well extend to the
operational personnel, to reflect their needs for analytical and decision making
competencies consequent upon ongoing systems integration and the breaking down
of traditional organizational hierarchies [19].
As with other such concepts, the meaning and interpretation of business
intelligence (BI) has shifted over the years. At its simplest, BI refers to the ability to
use information to gain a competitive advantage. This includes data on education,
skills, and the past performance of employees to help businesses identify the critical
talent within their organizations and ensure its development and retention [29]. BI is
often referred to as the techniques, technologies, systems, practices, methodologies,
and applications that analyze critical business data to help an organization better
understand its business and respond to the market through timely decisions [30].
Moving beyond BI to the development of more advanced apps and an enhanced
ability to drill down and answer the questions asked by BI engines [31], BI and Big
Data initiatives must advance together, rather than separately on parallel tracks. The
real value lies in integrating the existing BI and analytics capabilities with the new
big data technologies and techniques to focus on how these new capabilities can
augment and extend the existing environment [31].
It is clear that much work remains to be done for the anticipated benefits of big
data to be realized. The remaining issues of clarification around the specific content
of these large datasets (primitive data elements, information, knowledge or meta-
knowledge) and the relationship between terms like data, information, and
172 X. Tian, L. Liu
123
knowledge [25]. There are also risks to do with the misuse of data or with ‘false
discoveries’ in trying to find meaningful needles in massive haystacks of data [11].
Hence, this calls for improvement in such areas as data analysis, acquisition,
extraction and cleaning, integration, representation, and interpretation thorough
integration of systems. More generally, there are matters of the benefits and risks of
big data, both in organizational terms and as regards society as a whole, whether
these relate to potential problems in the areas of privacy, security and ethics, or to
emergence of a new version of the Digital Divide, a digital network divide [1, 32].
Critically, this ‘softer’ dimension to developments in big data extends to behavioral
and cultural issues at organizational level, and would extend to a much wider set of
norms and behaviors.
The promises of Big Data are real, however, there is currently a wide gap
between its potential and realization. Many leading researchers, such as Agrawal
et al. [33] and Zicari [34], emphasize the opportunities and challenges in big data
research and argue that the challenges include not just the obvious issues of scale,
but also heterogeneity, lack of structure, error-handling privacy, timeliness,
provenance, and visualization. These problems impede the progress at all phases
in the pipeline (acquisition/recording, extraction/cleaning annotation, integration/
aggregation/representation, analysis/modeling, and interpretation). Better under-
standing and management of these problems is the key of creating business value
from data [33]. The task of creating real value from data cannot be achieved by
simply by focusing on development of new technologies; it also requires us to
fundamentally rethink how we manage data analysis. The purpose of this paper is to
introduce a concept on how to manage data analysis.
2.2 Social information processing theory and online consumer reviews
The growing availability and popularity of social commerce sites have increased the
importance of online social information as a market force [35]. Previous studies
have revealed an association between eWOM (electronic word-of-mouth) and
product sales/revenue, which is mostly explained through either the awareness or
persuasive effects [36]. Alshibly and Chiong [37] showed that customer empow-
erment can increase e-government success. Opinion-based social information in the
form of eWOM communication can take place through several channels. For
example, consumers can post their opinions, and reviews of products on weblogs,
review websites, and social networking sites. Other studies have found that the
volume of consumer reviews is significantly related with product sales [38, 39].
Nowadays, online consumer reviews are becoming increasingly prevalent in the
majority of online shopping sites. Consumers use them either to find products that
match their preferences or to find information useful for offline purchases.
Typically, online consumer reviews can be deconstructed into several dimensions.
Scholars focusing on these have reported the different influences of such reviews on
the choice and sale of products. For instance, Kostyra et al. [40], focusing on
valence (i.e., average rating), volume (i.e., number of customer ratings) and
variance (i.e., variation in ratings) of online consumer reviews (also see [41]), found
that volume and variance only serve as moderators of the effect of valence. As well
Does big data mean big knowledge? Integration of big data… 173
123
it was revealed that they do not have a direct effect on consumers’ product choice.
Focusing on the volume and valence of online movie reviews, Liu [42] documented
that review volume, rather than review valence, increases both aggregate and
weekly box office revenue. By taking this finding a step further, Duan et al. [43]
proposed a dynamic simultaneous equation system in which the two dimensions
could be considered as both a precursor to, and an outcome of, retail sales. That is,
both a movie’s box office revenue and review valence significantly increase review
volume, which in turn leads to better box office performance. Moreover, as noted
earlier, previous studies concerning the effects of online consumer reviews have
investigated the issues either from the perspective retailers or consumers. In our
study, we focus on the effect valence of review on consumer purchase decision.
3 Research framework
In the foregoing literature review a number of commonalities emerged between
sources. Firstly, researchers examined knowledge contribution in general, with a
special focus on knowledge sharing. Although some studies noted that social
relational activities influence knowledge sharing [44], few studies explicitly
investigated social relational activities. Secondly, most studies relied on subjective
data collected through surveys, case studies, and focus groups to explore how and
why customers participate in online social communities. Straub et al. [45] argued
that actual usage and perceived usage are not always congruent. Instead of using
self-reported contributions, we used field data to measure the actual contributions.
Thirdly, we worked on the premise that extracting useful knowledge from big data
requires not only scalable analytics services, but also theoretical support.
Boyer et al. [46] identified the obstacles to successful BI projects as not as much
technological, as lying within the organization itself. They thus advocated strategic
enterprise BI programs that would enable business users to make informed decisions
based on the collaborative leveraging of enterprise-wide information. This
prediction seems relevant today and to the circumstances of big data, whose
analytics-driven insights must be closely linked to business strategy, easy for end
users to understand, and embedded into organizational processes to enable action at
the right time [47]. The strategic significance of such activities is emphasized by
Elbashir et al. [48] who recognize the critical importance of collaborative
knowledge synergies between senior management, CIOs and IT managers in
facilitating improved decision making across the organization’s value chain. For
present purposes this is a reminder of the importance of the knowledge component,
both in relation to current expectations of big data, and in acknowledgement of the
fact that many organizations are still failing to use available data and knowledge
effectively, let alone being in a position to accommodate the demands of big data.
Admittedly big data really could turn out to be different, but another lesson from
experience is that the more data and information are available, the greater the need
for human judgement in decision making. Even in reporting big data-related
advances in cognitive augmentation and job automation, McGovern [49] concedes
that there are still many tasks where the combination of humans and machines
174 X. Tian, L. Liu
123
produce superior results. As shown in Fig. 1, the availability of data alone does not
equate to value creation [25], and what is often missing is the knowledge to extract
wisdom from it, and to take decisions about what data to keep and what to discard,
and how to store what is kept reliably with the right metadata [50]. Processing data
requires not only the right technologies and analytic tools/techniques, but also
consideration of the wider organizational dimension. This will mark the transfor-
mation from big/massive data to knowledge which can be used for making business
predictions and decisions. This raises questions not only as to the relative value of
big data predictions and knowledge-based decisions, but also as to future
relationships between big data and knowledge management.
In this paper, to support the above arguments, we use two empirical studies to
investigate the influence of social relational activities on customer purchase
decisions. We also compare the findings from two studies which indicate that big
data analysis may not provide the best results without consideration of the
organizational dimension.
4 Empirical study
We conducted an empirical study by combining a big data tool (BigML) and a
conceptual model. We used BigML to examine how the two social information
channels (i.e., friends-based social information and opinion leaders-based social
information) influence consumer purchase decisions on social commerce sites.
Fig. 1 Big data analysis framework
Does big data mean big knowledge? Integration of big data… 175
123
4.1 Data collection
The data for this study was collected from a popular social commerce site in Asia,
which provides a platform for consumers to share their experience related to the
purchased cosmetics, and to interact with other consumers. A consumer in the social
commerce site can rate, for example, a cosmetics product (of a particular brand)
while sharing her or his experience about this product. The consumer can also
follow other consumers whose posts or ratings are useful. Figure 2 shows the social
network structure of ego consumer network in this community. As the provision of
ratings is optional, some consumers choose not to provide ratings about their
products or a particular brand.
In total, the website includes 10,097 products from 60 different brands. All of
brands were used for analysis in this study. Moreover, there are 163,845 consumers
within sixty brands. We crawled panel data from this site in Nov 2012. Based on the
customers ID, we crawled the network data for each customer from the community.
Specifically, we collected the ‘‘following’’ and ‘‘follower’’ of customers’ lists and
built an egocentric network for each customer. Then we crawled friends-based
social information (i.e., friends’ review valence and friends’ purchase) and opinion
leaders-based social information (i.e., opinion leaders’ review valence and opinion
leaders’ purchase). Table 1 presents the summary statistics of the variables.
4.2 Operationalization of variables
Opinion leaders’ review valence is operationalized as the total score of ratings (on
products in a particular brand) provided by individuals who are the members’
reference persons.
Friends’ review valence is operationalized as the total score of ratings provided
by individuals who are friend of a consumer in the social commerce site.
Opinion leaders’ purchase is operationalized as the total number of products in
the buy-lists of a consumer’s reference persons in the social commerce site.
Fig. 2 Social network of ego customers
176 X. Tian, L. Liu
123
Friends’ purchase is operationalized as the total number of products in the buy-
lists of a consumer’s friends in the social commerce site.
Consumer purchase behavior is operationalized as the number of products in a
consumer’s buy-list. Such operationalization was considered because consumers
can add products to their buy-lists to indicate that they have already bought specific
products in the social commerce site.
4.3 Data analysis tool: BigML
BigML (https://bigml.com) is an approach to machine learning. Users can set up
data sources, create, visualize and share prediction models, and use models to
generate predictions. It implements a set of basic data mining procedures. BigML
includes several advanced procedures that support interactive analysis of big data.
For example, data histograms are first rendered on a data sample and then updated
when the entire data set is loaded and processed. Decision trees provide a supervised
machine learning technique that starts with the root classification node, which is
then iteratively refined. Decision trees in BigML are rendered on the client appli-
cation dynamically, and evolve as their nodes are induced on the computational
server. More important, the process of preparing a model in BigML can be divided
into several consecutive steps: (1) data preparation, (2) association model learning,
(3) model selection, (4) model testing, and (5) development of model.
4.4 Phase 1: BigML analysis: base model
To analyze our data in BigML, we set consumer purchase decisions as our objective.
We first used BigML to run the selected dataset with the Base Model (1 Click
Model). The results of the decision tree are shown in Fig. 3. This indicates how
important each factor is in the prediction of consumer purchase decisions. Friends’
purchase is the most important factor, which is followed by opinion leaders’ review
and opinion leaders’ purchase. The result of the BigML analysis ran with the
Default Model is not consistent with the regression analysis. This means that the
Default Model does not fit the business rules well, which are artifacts of the data.
BigML’s Default Model does not fit the business rules, and therefore we should
adjust the training model by integrating a conceptual framework from the existing
literature.
Table 1 Descriptive statistics Mean SD
Friends 6.92 8.37
Opinion leaders 4.07 17.66
Friends’ review valence 5.99 16.50
Friends’ purchase 1.53 8.05
Opinion leaders’ review valence 5.05 29.56
Opinion leaders’ purchase 1.07 14.61
Does big data mean big knowledge? Integration of big data… 177
123
4.5 Phase 2 BigML analysis: integrating the conceptual model into the base model
4.5.1 Conceptual research model
Based on social information processing theory, we propose a conceptual research
model shown in Fig. 4, which should be integrated with the default model of
BigML.
Consequently we developed the following hypotheses:
H1a Friends’ review valence will positively influence ego-consumer purchase behavior.
Fig. 3 Results from training data with the default model
Fig. 4 Research model
178 X. Tian, L. Liu
123
H1b Friends’ purchase will positively influence ego-consumer purchase behavior.
H2a Opinion leaders’ review valence will positively influence ego-consumer purchase behavior.
H2b Opinion leaders’ purchase will positively influence ego-consumer purchase behavior.
Based on the conceptual framework, we conducted the following steps to adjust
the model in BigML: (1) association model learning; (2) model testing; and (3)
development of model of purchase decision marking. The results from running the
data set with the adjusted model are shown in Fig. 5. The results indicate that the
adjusted model fits the theoretical model better and therefore provides more
accurate results concerning the antecedents of consumers’ purchase decisions.
4.6 Robust check
Consumer purchase decision is represented by using a count variable summing up
all the products that are added to a buy-list. Negative binomial regression is often
used to analyze count data. Thus we use negative binomial regression to analyze the
dataset. To test the robustness of the adjusted model from BigML, we ran a negative
binomial regression analysis for the model.
Consumer Purchase Decision ¼ b0 þ b1Opinionleader Review þ b2Friend Review þ b3Opinionleader Purchase þ b4Friend Purchaseþ w ð1Þ
The regression analysis results are summarized in Table 2. The results indicate
that friends’ reviews valence have a significant positive effect (b = 0.012, p\ 0.001) on consumers’ purchase decisions. The regression results suggest that
Fig. 5 BigML analysis model 2
Does big data mean big knowledge? Integration of big data… 179
123
opinion review also has a significant positive effect (b = 0.051, p\ 0.001) on consumers’ purchase decisions. Table 2 reveals that opinion leaders’ review valence
exerts a significant positive moderating effect on consumers’ purchase decisions
(b = 0.006, p\ 0.001). The positive coefficient (b = 0.035, p\ 0.001) indicates that opinion leaders’ purchase exerts a significant positive effect on consumers’
purchase decisions. Furthermore, the results are consistent with the adjusted model
of BigML. Therefore, our model developed in BigML is robust.
5 Discussion and conclusion
5.1 Research implications
Recently, the Big Data era has descended on many communities, from governments
to e-commerce to health organizations. To avoid being data driven, the big data
analysis process should be adopted to leverage the opportunities presented by the
abundant data and domain-specific analytics. In addition, while machine learning
algorithms exist, most of them produce ‘‘black box’’ models, which are difficult to
understand. This study presents a complete implementation of a big data analysis
process by integrating machine learning with a theory based on business rules from
the existing literature. Instead of developing machine learning models solely on
specific training datasets, researchers should integrate theory based business rules to
adjust the machine learning models and use test datasets to test the adjusted machine
learning models in order to attain comprehensive models that are more relevant and
better reflect the reality (see Fig. 6).
The literature of big data emphasises the application of algorithms to pattern
analysis and prediction. We have created a big data processing model to emphasise
that processing data requires not only the right technologies and analytic tools/
techniques, but also consideration of the wider organizational dimension. This will
drive the transformation from big/massive data to knowledge to business value
processes (as shown in Fig. 5 below). This raises questions not only as to the
relative value of big data predictions and knowledge-based decisions, but also as to
future relationships between big data and knowledge management. Big Data is still
in its infancy and is quintessentially a technology or set of still developing
Table 2 Results of regression analysis
Model Unstandardized coefficients Sig.
B Std. error
(Constant) .364 .004 .000
Friends’ review valence .012 .000 .000
Friends’ purchase .051 .001 .000
Opinion leaders’ review valence .006 .000 .000
Opinion leaders’ purchase .035 .001 .000
180 X. Tian, L. Liu
123
technologies. Experience has shown time and again that the contribution of new
technologies is dependant very much on organizational and especially, human
factors.
5.2 Conclusion
Making sound business decisions based on accurate and current information and
knowledge requires more than simple intuition, and BI has become indispensable to
organizational success in the global economy. We need not only to be skilled
engineers, but also domain experts. Big data will become big knowledge through the
combination of data mining of association rules and the conceptual model of
business rules. Data mining and domain experts will be able to use this for
extending the possibilities of the model not only for user-defined business rules, but
also for models generated from association rules gained from machine learning.
This paper started from the perception that big data and business intelligence,
while facing challenges in developing systems for a new data savvy generation, had
the potential to leverage opportunities presented by the abundant data and domain-
specific analytics needed in many critical areas.
This study assessed the relevance of these trends in the current business context
through evidence-based documentation of current and emerging applications as well
as their wider business implications. We used the comprehensive model after
verifying that the machine learning model was inaccurate and thus unsuitable to the
task in hand. The proposed framework was tested using in an empirical study the
results of which suggest that big data can become big knowledge by the
combination of data mining of association rules and a conceptual model of business
rules. This big data analysis framework and theoretical model contributes to the
existing literature and offers important and interesting insights to IS research and
practice.
Fig. 6 Big data analysis process
Does big data mean big knowledge? Integration of big data… 181
123
References
1. Buchholtz, S., Bukowski, M., & Sniegocki, A. (2014). Big and open data in Europe: A growth engine
or a missed opportunity. Warsaw, demosEuropa und Warsaw Institute for Economic Studies.
Retrieved from http://www.bigopendata.eu/wp-content/uploads/2014/01/bodeurope2020fullreportsingle
page.pdf.
2. Reich, B. H., & Benbasat, I. (2000). Factors that influence the social dimension of alignment between
business and information technology objectives. MIS Quarterly, 24(1), 81–113.
3. Biere, M. (2003). Business intelligence for the enterprise. Upper Saddle River, NJ: Prentice Hall
Professional.
4. Williams, S., & Williams, N. (2010). The profit impact of business intelligence. Burlington, NJ:
Morgan Kaufmann.
5. Wixom, B. H., Watson, H. J., & Werner, T. (2011). Developing an enterprise business intelligence
capability: The Norfolk Southern Journey. MIS Quarterly Executive, 10(2), 61–71.
6. Wernerfelt, B. (1984). A resource-based view of the firm. Strategic Management Journal, 5(2),
171–180.
7. Barney, J. B. (2001). Resource-based theories of competitive advantage: A ten-year retrospective on
the resource-based view. Journal of Management, 27(6), 643–650.
8. Eisenhardt, K. M., & Santos, F. M. (2002). Knowledge-based view: A new theory of strategy.
Handbook of strategy and management (Vol. 1, pp. 139–164). London: Sage.
9. Spender, J. C., & Grant, R. M. (1996). Knowledge and the firm: Overview. Strategic Management
Journal, 17(S2), 5–9.
10. Reynolds, P., & Yetton, P. (2015). Aligning business and IT strategies in multi-business organiza-
tions. Journal of Information Technology, 30, 101–118.
11. Lohr, S. (2012). The age of big data. New York: New York Times.
12. McAfee, A., Brynjolfsson, E., Davenport, T. H., Patil, D., & Barton, D. (2012). Big data. The
Management Revolution. Harvard Business Review, 90(10), 61–67.
13. Freeland, C. (2012). Big data’s big impact. Accessed January 12.
14. Wixom, B., Ariyachandra, T., Douglas, D., Goul, M., Gupta, B., Iyer, L., et al. (2014). The current
state of business intelligence in academia: The arrival of big data. Communications of the Association
for Information Systems, 34(1), 1.
15. Dumbill, E. (2012). What is big data? An introduction to the big data landscape. Sebastopol, CA:
O’Reilly Media Inc.
16. Daniel, B. (2014). Big data and analytics in higher education: Opportunities and challenges. British
Journal of Educational Technology. doi:10.1111/bjet.12230.
17. Batra, S. (2014). Big data analytics and its reflections on the DIKW hierarchy. Review of Man-
agement, 4(1/2), 5–17.
18. Kapoor, B. (2010). Business intelligence and its use for humans. The Journal of Human Resource and
Adult Learning, 6(2), 21–30.
19. Berner, M., Graupner, E., & Maedche, A. (2014). The information panopticon in the big data era.
Journal of Organization Design, 3(1), 14–19.
20. Laney, D., Hung, L. H., & Lapkin, A. (2013). Big data means big business. Financial Times.
21. Fast Company (2014). The world’s top 10 most innovative companies in Big Data. https://www.
fastcompany.com/most-innovative-companies/2014/industry/big-data.
22. Wagner, D., Vollmar, G., & Wagner, H.-T. (2014). The impact of information technology on
knowledge creation: An affordance approach to social media. Journal of Enterprise Information
Management, 27(1), 31–44.
23. Boulton, C. (2013). Much Hadoop about nothing. Wall Street Journal, 27(1), 31–44.
24. Schroeck, M., Shockley, R., Smart, J., Romero-Morales, D., & Tufano, P. (2012). Analytics: The
real-world use of big data. Somers, NY: IBM Global Business Services.
25. Zhang, D. (2013). Granularities and inconsistencies in big data analysis. International Journal of
Software Engineering and Knowledge Engineering, 23(06), 887–893.
26. Economist. (2010, Febraury 25). A different game: Information is transforming traditional busi-
nesses. The Economist.
27. Lund, S. (2013). Game changers: Five opportunities for US growth and renewal. New York:
McKinsey.
182 X. Tian, L. Liu
123
28. Manyika, J., Chui, M., Brown, B., Bughin, J., Dobbs, R., Roxburgh, C., et al. (2011). Big data: The
next frontier for innovation, competition, and productivity. New York: McKinsey Global Institute.
29. Davenport, T. H. (2006). Competing on analytics. Harvard Business Review, 84(1), 98.
30. Chen, H., Chiang, R. H., & Storey, V. C. (2012). Business intelligence and analytics: From big data
to big impact. MIS Quarterly, 36(4), 1165–1188.
31. Bennett, M. (2014). Leverage the convergence of BI and Big Data in 2015 or miss out. Edmonds,
WA: Martha Bennett’s Blog.
32. McNeely, C. L., & Hahm, J. O. (2014). The big (data) bang: Policy, prospects, and challenges.
Review of Policy Research, 31(4), 304–310.
33. Agrawal, D., Bernstein, P., Bertino, E., Davidson, S., Dayal, U., Franklin, M., et al. (2012). Chal-
lenges and opportunities with big data A community white paper developed by leading researchers
across the United States. Washington, D.C.: Computing Research Association.
34. Zicari, R. V. (2014). Big Data: Challenges and opportunities. In R. Akerkar (Ed.), Big data com-
puting (pp. 103–128). New York: CRC Press, Taylor & Francis Group.
35. Godes, D., Mayzlin, D., Chen, Y., Das, S., Dellarocas, C., Pfeiffer, B., et al. (2005). The firm’s
management of social interactions. Marketing Letters, 16(3–4), 415–428.
36. Dellarocas, C., Zhang, X. M., & Awad, N. F. (2007). Exploring the value of online product reviews in
forecasting sales: The case of motion pictures. Journal of Interactive marketing, 21(4), 23–45.
37. Alshibly, H., & Chiong, R. (2015). Customer empowerment: Does it influence electronic government
success? A citizen-centric perspective. Electronic Commerce Research and Applications, 14,
393–404.
38. Davis, A., & Khazanchi, D. (2008). An empirical study of online word of mouth as a predictor for
multi-product category e-commerce sales. Electronic Markets, 18(2), 130–141.
39. Duan, W., Gu, B., & Whinston, A. B. (2009). Informational cascades and software adoption on the
internet: An empirical investigation. MIS Quarterly, 33(1), 23–48.
40. Kostyra, D. S., Reiner, J., Natter, M., & Klapper, D. (2015). Decomposing the effects of online
customer reviews on brand, price, and product attributes. International Journal of Research in
Marketing, 33, 11–26.
41. Chintagunta, P. K., Gopinath, S., & Venkataraman, S. (2010). The effects of online user reviews on
movie box office performance: Accounting for sequential rollout and aggregation across local
markets. Marketing Science, 29(5), 944–957.
42. Liu, Y. (2006). Word of mouth for movies: Its dynamics and impact on box office revenue. Journal of
Marketing, 70(3), 74–89.
43. Duan, W., Gu, B., & Whinston, A. B. (2008). Do online reviews matter?—An empirical investigation
of panel data. Decision Support Systems, 45(4), 1007–1016.
44. Liang, T.-P., Ho, Y.-T., Li, Y.-W., & Turban, E. (2011). What drives social commerce: The role of
social support and relationship quality. International Journal of Electronic Commerce, 16(2), 69–90.
45. Straub, D., Limayem, M., & Karahanna-Evaristo, E. (1995). Measuring system usage: Implications
for IS theory testing. Management Science, 41(8), 1328–1342.
46. Boyer, J., Frank, B., Green, B., Harris, T., & Van De Vanter, K. (2010). A practical guide for
achieving BI excellence. New York: IBM Corporation.
47. Balboni, F., & Cook, S. (2012). Analytics in the boardroom: Accelerating competitive advantage.
Somers, NY: IBM Institute for Business Value.
48. Elbashir, M. Z., Collier, P. A., Sutton, S. G., Davern, M. J., & Leech, S. A. (2013). Enhancing the
business value of business intelligence: The role of shared knowledge and assimilation. Journal of
Information Systems, 27(2), 87–105.
49. McGovern, T. (2015). Big Data Now.
50. Association, C. R. (2012). Challenges and opportunities with big data. http://www.cra.org/ccc/files/
docs/init/bigdatawhitepaper.pdf.
Does big data mean big knowledge? Integration of big data… 183
123
Reproduced with permission of the copyright owner. Further reproduction prohibited without permission.
- c.10660_2016_Article_9242.pdf
- Does big data mean big knowledge? Integration of big data analysis and conceptual model for social commerce research
- Abstract
- Introduction
- Literature review
- Big data and business intelligence
- Social information processing theory and online consumer reviews
- Research framework
- Empirical study
- Data collection
- Operationalization of variables
- Data analysis tool: BigML
- Phase 1: BigML analysis: base model
- Phase 2 BigML analysis: integrating the conceptual model into the base model
- Conceptual research model
- Robust check
- Discussion and conclusion
- Research implications
- Conclusion
- References