Instructions: This is an individual assignment. Use Blackboard to submit your answers on the due date (no hard copies please). Late submissions will receive a zero grade.
Experimentation with Classification: Choose a dataset that is well suited for classification. You can use any dataset that you would like to classify. A good number of datasets can be found in the UCI machine learning data repository but feel free to use any dataset that you want. Make sure that you select a dataset that has a class variable. Then use a tool such as R, Weka, or RapidMiner to classify the dataset. The specific requirements for the assignment are as follows:
Choose a dataset that is of interest to you and is well suited for classification
Give a brief description of the dataset
Test at least 3 classification algorithms. There are many algorithms available for R, Weka, RapidMiner, and KNIME.
oA good resource for R can be found at the Data Mining Algorithms in R Wikibook
https://en.wikibooks.org/wiki/Data_Mining_Algorithms_In_R/Classification
oAlso the caret package in R would is a great place to start experimenting with classification methods.
Design an experiment using training and testing (holdout method), cross-validation, or the bootstrap method.
Compare the results of three or more classification methods using the same experimental setup using one or more classification evaluation methods discussed in class. The metrics that you choose are up to you and can include accuracy, error rate, sensitivity, specificity, precision, recall, and F measure.
Write a report that describes your experiment and results. The report should be in either ACM or IEEE conference paper format and should include an introductory section that details the dataset and the objectives of the analysis, a methodology section that explains the approach that you are using to mine the dataset including the algorithms and parameters (e.g. confidence and support) as well as any steps that you had to take to preprocess the data, a results section that shows the results of your analysis and any interesting patterns that you found, and a conclusion section that summarizes your results and discusses the limitations of your approach and any difficulties that you had with your experiment.
oLinks to format templates:
ohttps://www.ieee.org/conferences_events/conferences/publishing/templates.html
ohttps://www.acm.org/sigs/publications/proceedings-templates
Place your order now for a similar paper and have exceptional work written by our team of experts to guarantee you A Results
Why Choose US
6+ years experience on custom writing
80% Return Client
Urgent 2 Hrs Delivery
Your Privacy Guaranteed
Unlimited Free Revisions
You May Also Like This:
- Literature search to select a qualitative research study data analysis, sample size and data collection, qualitative research design, participants, research approach, data management
- Importance of business intelligence and data analytics to public and private sector organisation.
- You have been engaged as a consultant to the Local Health District (LHD). The LHD governing council requires you to develop a report based on data from ‘UTS Hospital
- Molecular Taxonomy
- graphical analysis
- Data structures and algorithms assessment
- Quantitative Design and Data Collection
- Analysis / Forecasting Data
- Big Data
- Word sense disambiguation
- Nurs 6003N Week 7 Assignment Assignment: Academic Success and Professional Development Plan Part 4: Research Analysis
- Data Driven Decision Making
- Big Data and The Evolution of Healthcare
- Nurs 6003N Week 5 Assignment Assignment: Academic Success and Professional Development
- Longitudinal data
- NR443-14471 Week 1 Discussion: Social Factors Go online to the U.S. Census Bureau at https://www.census.gov/quickfacts/ (Links to an external site.) (*note, you will be using this website in the Week 2 assignment, so you can get started on collecting all the required data if desired)
- Data Analysis, Findings and Discussion
- vote for Obama in 2008?
- Database and Data Warehousing Design
- Techniques and Tools for Managing the Data
- Health Informatics Week 5 Assignment
- Management Data Analysis / Survey ( basic concepts of probability and statistical analysis )Global Business Management
- Real Estate Data
- Comparative Company, Industry, and Economic Data
- Statistics Canada data
- Managing Data
- Health Care Data
- Statistical data
- Understanding Individuals: Personal Construct Psychology
- Apple sales using system analysis