Discussion - Data Management
Need comment for this discussion
1. What are the business costs or risks of poor data quality?
At to begin with, data quality was confined to just the CRM systems. This versatile quality now extends past sorted out customer data. To start modifying the data quality, you need to get inside the corner and acknowledge what correctly cause horrendous data:
1. Missing Data: Empty fields that ought to contain information.
2. Improper information: Data that has been entered in the wrong field.
3. Non-adjusting information: Data that hasn't been standardized according to the arrangement of records.
4. Copy information: A solitary Account, Contact, Lead, and so forth that possesses more than one record in the database.
5. Poor information passage: Misspells, errors, transpositions, and varieties in spelling, naming, or arranging.
Reference:
1) Criticality of data quality as exemplified in two disasters by CW Fisher, BR Kingma - Information & Management, 2001
2) Principles of data mining by DJ Hand - Drug safety, 2007
3) Criticality of data quality as exemplified in two disasters by CW Fisher, BR Kingma - Information & Management, 2001
2. What is Data mining?
Data mining is getting data, learning or information from a gigantic accumulation of information. Information can be brought in various examples, substantial lumps, distinctive kinds of sets. It is process where different techniques are connected to acquire required examples of information. Information digging is for the most part improved the situation breaking down information, making connections between the information designs and to take care of issues.
Extraction, change and putting away information in information stockroom is one of the underlying advances associated with Data mining. Information ought to be appropriately overseen in databases. The information that is put away ought to be open. Along these lines, giving access to obtain information is essential. The gained information is then utilized for breaking down and it is introduced in various structures, for example, diagram, charts, histograms and so forth.
Case: Data Mining in Walmart-Walmart stores all the fundamental data that are gathered in its information distribution center. It likewise gives access to the its providers which causes them to recognize diverse buying examples of its clients. Walmart can sort diverse examples into various sets, for example, number of clients purchasing a specific item, the propensity for clients, taste, conduct, request of the item and so on.
Different procedures are required to mine information. These different strategies are received from numerous spaces, for example, Database (DB), Recognition of Pattern, Applications, Data Warehouse, Visualization, Statistics, Algorithms, Machine Learning and so on. Data mining is done through different web crawlers, for example, Google, Yahoo, Bing and so forth these web crawlers additionally utilize different methods to help bring information, for example, legitimate ordering, settling on what pages ought to be shown initially, what ad ought to be shown and at which place of the page.
Data mining is exceptionally valuable till a specific degree however abusing individual data or unveiling any secret or delicate information can be a potential danger for this situation. Every one of these perspectives ought to be thought about.
Reference:
1) Knowledge Management and Data Mining by MJ Shaw, C Subramaniam, GW Tan and ME Welge
2) (Larose, 2014), Discovering Knowledge in Data: An Introduction to Data Mining
3) Computational historiography: Data mining in a century of classics journals by D Mimno - Journal on Computing and Cultural Heritage (JOCCH), 2012
3. What is text mining?
Text mining is tied in with preparing the unstructured type of data, determining a critical numeric incentive from the content and to guaranteeing the data is put away in the content. The data put away in the content, ought to be effectively open and the information can be utilized to investigated. In this way, fundamentally message mining changes content into numbers.
Text mining is extremely valuable and supportive in different areas and parts. In numerous associations, Text mining innovation is utilized to enhance client benefit involvement. It used to expand their pace of reacting back to the clients and furthermore to enhance proficiency. It is utilized as a part of giving mechanized answer back messages to the clients relying upon what questions that have asked in their email.
Text mining is likewise utilized as a part of sifting the spam messages, recognizing any fake action, diminishing dangers in web identified with digital violations. It is likewise utilized for breaking down information identified with online networking. In web-based social networking, there are unstructured information heaped up. To comprehend the necessities of the clients, tastes of the client, request of the item, it is anything but difficult to separate right data through online networking by content mining. It helps in giving sufficient information which can be investigated to bring out better outcomes.
Reference:
1) Linking genes to literature: text mining, information extraction, and retrieval applications for biology by M Krallinger , A Valencia … - Genome …, 2008
2) Text-mining and information-retrieval services for molecular biology by D Delen , MD Crossland - Expert Systems with Applications, 2008
3) The value and benefits of text mining by D McDonald, I McNicoll , G Weir , T Reimer… - JISC Digital 2012