Midterm
7/26/2020 SafeAssign Originality Report
https://ucumberlands.blackboard.com/webapps/mdb-sa-BB5a31b16bb2c48/originalityReportPrint?course_id=_116149_1&paperId=3169682491&&attemptId=a677… 1/12
31437.202030 - SUMMER 2020 - INTRO TO DATA MINING (ITS-632-10) - SECOND BI-TERM
Midterm Exam Krishna Sai Ravilla on Sun, Jul 26 2020, 11:30 AM
41% highest match Submission ID: a677ee06-0bad-4e9c-91f5-30b55e8b2c2f
Attachments (1)
BusinessIntelligence.docx 1
Running Head: USING RAPID MINER IN BUSINESS INTELLIGENCE
USING RAPID MINER IN BUSINESS INTELLIGENCE
1 USING RAPIDMINER IN BUSINESS INTELLIGENCE
2 KRISHNA SAI RAVILLA
3 07/24/2020
Introduction
RapidMiner is an open-source condition for machine learning and data analytics. It is seriously utilized for academic purposes at colleges just as for mechanical or business applications. The BOINC system likewise stood out as it gives the capacity to effectively arrangement an appropriated registering condition.
1 HISTORY OF THE RAPIDMINER
(http://safeassign.blackboard.com/)
BusinessIntelligence.docx Word Count: 1,361 Attachment ID: 3169682491
41%
7/26/2020 SafeAssign Originality Report
https://ucumberlands.blackboard.com/webapps/mdb-sa-BB5a31b16bb2c48/originalityReportPrint?course_id=_116149_1&paperId=3169682491&&attemptId=a677… 2/12
4 MACHINE LEARNING, PRESCIENT ANALYTICS, TEXT MINING, BUSINESS ANALYTICS AND DATA MINING ARE REFERRED IN TO AS A RAPID MINER. The RapidMiner venture was begun in 2001 by Ralf Klinkenberg, Ingo Mierswa, and Simon Fischer at the Artificial Intelligence Group of Katharina Morik at the Dortmund University of Technology. In 2007, the task officially known as YALE was renamed and distributed as RapidMiner form 4.0. From that point forward, the product is facilitated by SourceForge and is offered for nothing out of pocket as a Community Edition discharged under the GNU AGPL. There is likewise an Enterprise Edition offered under a business permit for joining into shut source ventures (Hofmann & Klinkenberg, 2016).
The product is written in Java and runs alleged procedures. A procedure is essentially an XML-File produced by the client and contains a grouping of assignments which are spoken to by administrators. More than 500 administrators are as of now remembered for the product. Their usefulness covers the principle parts of data analysis, for example, data stacking and change, data preprocessing and perception, demonstrating and model assessment. By joining these administrators, essential machine learning undertakings, for example, data mining, text mining, time arrangement analysis and determining, web mining just as supposition analysis and conclusion mining can be performed. The product additionally gives various techniques to envisioning high dimensional data sets. Since RapidMiner is written in Java, it is stage autonomous and can be effortlessly joined with other programming devices. Doing as such, the notable WEKA structure was coordinated into RapidMiner.
What's more, RapidMiner gives a heavenly module component, which can be utilized to effectively extended the usefulness of the centre programming. Since 2007, RapidMiner has been vigorously broadened and got one of the most significant data mining and logical data apparatuses. It is seriously utilized in early on courses and academic purposes at colleges everywhere throughout the world. RapidMiner is likewise utilized for everyday purposes by numerous organizations and experts for various applications.
1 REVIEW OF THE DATA
5 VETERAN EMPLOYMENT OUTCOMES (VEO) ARE NEW TRIAL U.S. Registration Bureau measurements on work showcase outcomes for as of late released Army veterans. These measurements are organized by military specialization, administration qualities, manager industry (whenever utilized), and veteran socioeconomics. They are produced by coordinating assistance part data with a national database of occupations,
7/26/2020 SafeAssign Originality Report
https://ucumberlands.blackboard.com/webapps/mdb-sa-BB5a31b16bb2c48/originalityReportPrint?course_id=_116149_1&paperId=3169682491&&attemptId=a677… 3/12
utilizing best in class classification insurance components to ensure the underlying data (Kotu & Deshpande, 2014).
1 VETERAN EMPLOYMENT OUTCOMES (VEO) IS TRIAL CLASSIFICATIONS CREATED BY THE LONGITUDINAL EMPLOYER-HOUSEHOLD DYNAMICS (LEHD) PROGRAM IN A JOINT EFFORT WITH THE U.S. Armed force and state organizations. With the help of Ranks and Military occupation VEO data provides us with the benefit of employment as a veteran and business functions. VEO is presently discharged as an examination data item in the "test" structure.
1 THE VEO GIVE DATA ON INCOME AND EMPLOYMENT FOR AS OF LATE RELEASED ARMY VETERANS. PROFIT IS ACCESSIBLE AT THE 25TH, 50TH, AND 75TH PERCENTILES, ONE, FIVE, AND TEN YEARS AFTER DETACHMENT FROM DEPLOYMENT-READY ASSISTANCE, BY RANK, OCCUPATION, AND RELEASE PARTNER. With the help of observed tables, the area of employment and the industries for veterans are incorporated. By comparing the data of both veteran records and employment of national database the appropriate measures are taken care
The VEO utilize front line differential security techniques to ensure the secrecy of the primary data, an assurance strategy created in software engineering to bound the protection hazard to people from various inquiries to a similar database. Differential protection strategies permit the Census Bureau to discharge definite organizations on veteran outcomes while limiting the security hazard to people in the data.
1 EXPLORING THE DATA WITH THE TOOL
Decision tree
1 CLASSIFICATIONS ALTERNATIVE TECHNIQUES
Like Random Forest, Gradient Boosting is another strategy for performing administered machine learning assignments, similar to characterization and relapse. The executions of this method can have various names; most generally, you experience Gradient Boosting machines.
Boosting fabricates models from individual purported "feeble students" in an iterative way. In the Random Forests part, I had just talked about the contrasts among Bagging and Boosting as tree group strategies. In boosting, the individual models are not based on totally arbitrary
7/26/2020 SafeAssign Originality Report
https://ucumberlands.blackboard.com/webapps/mdb-sa-BB5a31b16bb2c48/originalityReportPrint?course_id=_116149_1&paperId=3169682491&&attemptId=a677… 4/12
subsets of data and highlight yet successively by putting more weight on occasions with wrong expectations and big mistakes. The overall thought behind this is occurrences, which are challenging to foresee effectively ("troublesome" cases) will be centred around during learning, with the goal that the model gains from past errors. At the point when we train every troupe on a subset of the preparation set, we additionally call this Stochastic Gradient Boosting, which can help improve the generalizability of our model.
The slope is utilized to limit a misfortune work, like how Neural Nets use inclination plunge to streamline ("learn") loads. In each round of preparing, the feeble student is constructed, and its forecasts are contrasted with the right result that we anticipate. The separation among expectation and truth speaks to the mistake pace of our model. These mistakes would now be able to be utilized to compute the angle. The inclination is not all that much; it is fundamentally the incomplete subsidiary of our misfortune work - so it portrays the steepness of our mistake work. The slope can be utilized to discover the course in which to change the model boundaries to (maximally) diminish the mistake in the following round of preparing by "slipping the inclination"
1 OUTLINE OF RESULTS
Instruction at Enlistment Qualification for Army enrollment relies upon meeting sure training edges. Accordingly, almost all Army administration part records incorporate their training level at the time of enrollment. We utilize Army managerial data to create three classes of instruction-level: 5 GENERAL EDUCATIONAL DEVELOPMENT (GED) TEST, HIGH SCHOOL DIPLOMA, AND SOME COLLEGE OR HIGHER.
1 PAY GRADE WE USE PAY GRADE AT DIVISION TO CATCH EACH ASSISTANCE PART'S PRESENTATION DURING DEPLOYMENT-READY HELP. Because of the sparsity of cells, some compensation grade classifications are accumulated into more significant canisters. 5 ANNOUNCED COMPENSATION GRADE RECEPTACLES INCLUDE E1, E2, E3, E4, E5, E6, AND E7-E9, WITH E1 BEING THE COMPENSATION GRADE FOR PRIVATES AND E7-E9 BEING THE COMPENSATION GRADES FOR SENIOR NON-APPOINTED OFFICIALS LONG STRETCHES OF SERVICE WE UTILIZE THREE CONTAINERS TO CATCH THE DISPERSION OF RESIDENCY FOR DEPLOYMENT-READY HELP AT A YEAR OF PARTITION: 0-5, 6-19, AND 20+ YEARS. 1 NOTE THAT MOST ENROLLED ADMINISTRATION INDIVIDUALS SERVE UNDER FIVE YEARS AND VOCATION
7/26/2020 SafeAssign Originality Report
https://ucumberlands.blackboard.com/webapps/mdb-sa-BB5a31b16bb2c48/originalityReportPrint?course_id=_116149_1&paperId=3169682491&&attemptId=a677… 5/12
WORKFORCE ARE QUALIFIED FOR RETIREMENT AT 20 YEARS OF ADMINISTRATION.
MILITARY OCCUPATION OCCUPATION FOR ENROLLED STAFF INSIDE THE ARMY IS CHARACTERIZED BY A MILITARY OCCUPATION SPECIALTY (MOS) CODE. MOS CODE UTILIZATION FLUCTUATES AFTER SOME TIME AS NEW OCCUPATIONS ARE MADE, AND OLD ONES ARE DISPOSED OF OR REARRANGED. TO REPRESENT THESE CHANGES, WE TOTAL MOS OCCUPATION CODES TO THE DEPARTMENT OF DEFENSE'S MILITARY OCCUPATIONAL SPECIALTY CLASSIFICATION CODES AT THE 2-AND 3-DIGIT LEVELS.
EMPLOYER GEOGRAPHY EMPLOYMENT AND INCOME OUTCOMES ARE ACCESSIBLE FOR EVERY ONE OF THE 50 STATES AND THE DISTRICT OF COLUMBIA. A SPECIALIST IS RELEGATED TO A GIVEN STATE IF THEIR PREVAILING BOSS FOR THE SCHEDULE YEAR PAID UI TO PAY FOR THAT LABOURER IN THAT STATE. For bureaucratic workers, we utilize the area of the administration office to build up boss topography. 1 STATES ARE DISTINGUISHED BY THEIR FEDERAL INFORMATION PROCESSING STANDARD (FIPS) STATE CODE.
References
6 HOFMANN, M., & KLINKENBERG, R. (Eds.). (2016). RapidMiner: 7 DATA MINING USE CASES AND BUSINESS ANALYTICS APPLICATIONS. CRC Press.
8 KOTU, V., & DESHPANDE, B. (2014). 7 PREDICTIVE ANALYTICS AND DATA MINING: CONCEPTS AND PRACTICE WITH RAPIDMINER. Morgan Kaufmann.
9 US CENSUS BUREAU, C. (2010, January 01). 10 US CENSUS BUREAU CENTER FOR ECONOMIC STUDIES PUBLICATIONS AND REPORTS PAGE. 11 RETRIEVED JULY 24, 2020, FROM HTTPS://LEHD.CES.CENSUS.GOV/DATA/VEO_EXPERIMENTAL.HTML
Citations (11/11) 1 Another student's paper
7/26/2020 SafeAssign Originality Report
https://ucumberlands.blackboard.com/webapps/mdb-sa-BB5a31b16bb2c48/originalityReportPrint?course_id=_116149_1&paperId=3169682491&&attemptId=a677… 6/12
Matched Text
2 Another student's paper
3 https://www.census.gov/newsroom/tip-sheets/2020/tp20-10.html
4 https://allprogramminghelp.com/rapidminer-homework-help
5 https://lehd.ces.census.gov/data/veo_experimental.html
6 Another student's paper
7 https://jp.b-ok.org/terms/?q=rapidminer
8 Another student's paper
9 Another student's paper
10 Another student's paper
11 https://lehd.ces.census.gov/data/
Suspected Entry: 68% match
Uploaded - BusinessIntelligence.docx
USING RAPIDMINER IN BUSINESS INTELLIGENCE
Source - Another student's paper USING MINITAB IN BUSINESS INTELLIGENCE
Suspected Entry: 100% match
Uploaded - BusinessIntelligence.docx
HISTORY OF THE RAPIDMINER
Source - Another student's paper History of the RapidMiner
Suspected Entry: 100% match
Uploaded - BusinessIntelligence.docx
REVIEW OF THE DATA
Source - Another student's paper Review of the Data
Suspected Entry: 80% match
7/26/2020 SafeAssign Originality Report
https://ucumberlands.blackboard.com/webapps/mdb-sa-BB5a31b16bb2c48/originalityReportPrint?course_id=_116149_1&paperId=3169682491&&attemptId=a677… 7/12
Uploaded - BusinessIntelligence.docx
VETERAN EMPLOYMENT OUTCOMES (VEO) IS TRIAL CLASSIFICATIONS CREATED BY THE LONGITUDINAL EMPLOYER-HOUSEHOLD DYNAMICS (LEHD) PROGRAM IN A JOINT EFFORT WITH THE U.S
Source - Another student's paper Veteran Employment Outcomes (VEO) is preliminary arrangements made by the Longitudinal Employer- Household Dynamics (LEHD) program in a joint exertion with the U.S
Suspected Entry: 66% match
Uploaded - BusinessIntelligence.docx
THE VEO GIVE DATA ON INCOME AND EMPLOYMENT FOR AS OF LATE RELEASED ARMY VETERANS
Source - Another student's paper The VEO give information on pay and employment for starting late discharged Army veterans
Suspected Entry: 65% match
Uploaded - BusinessIntelligence.docx
PROFIT IS ACCESSIBLE AT THE 25TH, 50TH, AND 75TH PERCENTILES, ONE, FIVE, AND TEN YEARS AFTER DETACHMENT FROM DEPLOYMENT- READY ASSISTANCE, BY RANK, OCCUPATION, AND RELEASE PARTNER
Source - Another student's paper The benefit is available at the 25th, 50th, and 75th percentiles, one, five, and ten years after separation from arrangement prepared help, by rank, occupation, and discharge accomplice
Suspected Entry: 100% match
Uploaded - BusinessIntelligence.docx
EXPLORING THE DATA WITH THE TOOL
Source - Another student's paper Exploring the Data with the tool
Suspected Entry: 99% match
Uploaded - BusinessIntelligence.docx
CLASSIFICATIONS ALTERNATIVE TECHNIQUES
Source - Another student's paper Classifications Alternative Techniques
Suspected Entry: 100% match
Uploaded - BusinessIntelligence.docx
OUTLINE OF RESULTS
Source - Another student's paper Outline of Results
7/26/2020 SafeAssign Originality Report
https://ucumberlands.blackboard.com/webapps/mdb-sa-BB5a31b16bb2c48/originalityReportPrint?course_id=_116149_1&paperId=3169682491&&attemptId=a677… 8/12
Suspected Entry: 71% match
Uploaded - BusinessIntelligence.docx
PAY GRADE WE USE PAY GRADE AT DIVISION TO CATCH EACH ASSISTANCE PART'S PRESENTATION DURING DEPLOYMENT-READY HELP
Source - Another student's paper Pay Grade We use pay grade at division to get every help part's introduction during organization prepared assistance
Suspected Entry: 66% match
Uploaded - BusinessIntelligence.docx
NOTE THAT MOST ENROLLED ADMINISTRATION INDIVIDUALS SERVE UNDER FIVE YEARS AND VOCATION WORKFORCE ARE QUALIFIED FOR RETIREMENT AT 20 YEARS OF ADMINISTRATION
Source - Another student's paper Note that most enlisted organization people serve under five years, and business workforce is equipped for retirement at 20 years of the organization
Suspected Entry: 85% match
Uploaded - BusinessIntelligence.docx
MILITARY OCCUPATION OCCUPATION FOR ENROLLED STAFF INSIDE THE ARMY IS CHARACTERIZED BY A MILITARY OCCUPATION SPECIALTY (MOS) CODE
Source - Another student's paper Military Occupation Occupation for enlisted staff inside the Army is described by a Military Occupation Specialty (MOS) code
Suspected Entry: 72% match
Uploaded - BusinessIntelligence.docx
MOS CODE UTILIZATION FLUCTUATES AFTER SOME TIME AS NEW OCCUPATIONS ARE MADE, AND OLD ONES ARE DISPOSED OF OR REARRANGED
Source - Another student's paper MOS code use varies after some time as new occupations are made, and old ones are discarded or improved
Suspected Entry: 90% match
Uploaded - BusinessIntelligence.docx
TO REPRESENT THESE CHANGES, WE TOTAL MOS OCCUPATION CODES TO THE DEPARTMENT OF DEFENSE'S MILITARY
Source - Another student's paper To speak to these changes, we absolute MOS occupation codes to the Department of Defense's
7/26/2020 SafeAssign Originality Report
https://ucumberlands.blackboard.com/webapps/mdb-sa-BB5a31b16bb2c48/originalityReportPrint?course_id=_116149_1&paperId=3169682491&&attemptId=a677… 9/12
OCCUPATIONAL SPECIALTY CLASSIFICATION CODES AT THE 2-AND 3-DIGIT LEVELS
Military Occupational Specialty Classification codes at the 2-and 3-digit levels
Suspected Entry: 70% match
Uploaded - BusinessIntelligence.docx
EMPLOYER GEOGRAPHY EMPLOYMENT AND INCOME OUTCOMES ARE ACCESSIBLE FOR EVERY ONE OF THE 50 STATES AND THE DISTRICT OF COLUMBIA
Source - Another student's paper Business Geography Employment and salary outcomes are open for all of the 50 states and the District of Columbia
Suspected Entry: 65% match
Uploaded - BusinessIntelligence.docx
A SPECIALIST IS RELEGATED TO A GIVEN STATE IF THEIR PREVAILING BOSS FOR THE SCHEDULE YEAR PAID UI TO PAY FOR THAT LABOURER IN THAT STATE
Source - Another student's paper A master is consigned to a given state if their overall manager for the timetable year paid UI to pay for that work in that state
Suspected Entry: 88% match
Uploaded - BusinessIntelligence.docx
STATES ARE DISTINGUISHED BY THEIR FEDERAL INFORMATION PROCESSING STANDARD (FIPS) STATE CODE
Source - Another student's paper States are recognized by their Federal Information Processing Standard (FIPS) state code
Suspected Entry: 63% match
Uploaded - BusinessIntelligence.docx
KRISHNA SAI RAVILLA
Source - Another student's paper Sai Krishna Rachakonda
Suspected Entry: 65% match
Uploaded - BusinessIntelligence.docx
07/24/2020
Source - https://www.census.gov/newsroom/tip- sheets/2020/tp20-10.html April 24, 2020
7/26/2020 SafeAssign Originality Report
https://ucumberlands.blackboard.com/webapps/mdb-sa-BB5a31b16bb2c48/originalityReportPrint?course_id=_116149_1&paperId=3169682491&&attemptId=a67… 10/12
Suspected Entry: 62% match
Uploaded - BusinessIntelligence.docx
MACHINE LEARNING, PRESCIENT ANALYTICS, TEXT MINING, BUSINESS ANALYTICS AND DATA MINING ARE REFERRED IN TO AS A RAPID MINER
Source - https://allprogramminghelp.com/rapidminer- homework-help RapidMiner Studio is a "downloadable GUI for machine learning, data mining, text mining, predictive analytics, and business analytics."
Suspected Entry: 86% match
Uploaded - BusinessIntelligence.docx
VETERAN EMPLOYMENT OUTCOMES (VEO) ARE NEW TRIAL U.S
Source - https://lehd.ces.census.gov/data/veo_experimental.h tml Veteran Employment Outcomes (VEO) are new experimental U.S
Suspected Entry: 100% match
Uploaded - BusinessIntelligence.docx
GENERAL EDUCATIONAL DEVELOPMENT (GED) TEST, HIGH SCHOOL DIPLOMA, AND SOME COLLEGE OR HIGHER
Source - https://lehd.ces.census.gov/data/veo_experimental.h tml General Educational Development (GED) Test, High School Diploma, and Some College or Higher
Suspected Entry: 64% match
Uploaded - BusinessIntelligence.docx
ANNOUNCED COMPENSATION GRADE RECEPTACLES INCLUDE E1, E2, E3, E4, E5, E6, AND E7-E9, WITH E1 BEING THE COMPENSATION GRADE FOR PRIVATES AND E7- E9 BEING THE COMPENSATION GRADES FOR SENIOR NON-APPOINTED OFFICIALS LONG STRETCHES OF SERVICE WE UTILIZE THREE CONTAINERS TO CATCH THE DISPERSION OF RESIDENCY FOR DEPLOYMENT-READY HELP AT A YEAR OF PARTITION
Source - https://lehd.ces.census.gov/data/veo_experimental.h tml E1, E2, E3, E4, E5, E6, and E7-E9, with E1 being the pay grade for Privates and E7-E9 being the pay grades for senior non-commissioned officers (i.e
Suspected Entry: 100% match
7/26/2020 SafeAssign Originality Report
https://ucumberlands.blackboard.com/webapps/mdb-sa-BB5a31b16bb2c48/originalityReportPrint?course_id=_116149_1&paperId=3169682491&&attemptId=a67… 11/12
Uploaded - BusinessIntelligence.docx
0-5, 6-19, AND 20+ YEARS
Source - https://lehd.ces.census.gov/data/veo_experimental.h tml 0-5, 6-19, and 20+ years
Suspected Entry: 100% match
Uploaded - BusinessIntelligence.docx
HOFMANN, M., & KLINKENBERG, R
Source - Another student's paper Hofmann, M., & Klinkenberg, R
Suspected Entry: 99% match
Uploaded - BusinessIntelligence.docx
DATA MINING USE CASES AND BUSINESS ANALYTICS APPLICATIONS
Source - https://jp.b-ok.org/terms/?q=rapidminer Data Mining Use Cases and Business Analytics Applications
Suspected Entry: 99% match
Uploaded - BusinessIntelligence.docx
PREDICTIVE ANALYTICS AND DATA MINING
Source - https://jp.b-ok.org/terms/?q=rapidminer Predictive Analytics and Data Mining
Suspected Entry: 100% match
Uploaded - BusinessIntelligence.docx
CONCEPTS AND PRACTICE WITH RAPIDMINER
Source - https://jp.b-ok.org/terms/?q=rapidminer Concepts and Practice with RapidMiner
Suspected Entry: 100% match
Uploaded - BusinessIntelligence.docx
KOTU, V., & DESHPANDE, B
Source - Another student's paper Kotu, V., & Deshpande, B
Suspected Entry: 75% match
Uploaded - BusinessIntelligence.docx
US CENSUS BUREAU, C
Source - Another student's paper US Census Bureau Center
7/26/2020 SafeAssign Originality Report
https://ucumberlands.blackboard.com/webapps/mdb-sa-BB5a31b16bb2c48/originalityReportPrint?course_id=_116149_1&paperId=3169682491&&attemptId=a67… 12/12
Suspected Entry: 85% match
Uploaded - BusinessIntelligence.docx
US CENSUS BUREAU CENTER FOR ECONOMIC STUDIES PUBLICATIONS AND REPORTS PAGE
Source - Another student's paper “US Census Bureau Center for Economic Studies Publications and Reports Page.” Data, 1 Jan
Suspected Entry: 73% match
Uploaded - BusinessIntelligence.docx
RETRIEVED JULY 24, 2020, FROM HTTPS://LEHD.CES.CENSUS.GOV/DATA/VEO_EX PERIMENTAL.HTML
Source - https://lehd.ces.census.gov/data/ lehd.ces.census.gov/data/veo_experimental.html