COMP3425 Data Mining 代写，以前完成的啦，如果有需要写的直接联系我们
COMP3425 Data Mining S1 2018
Maximum marks 100
Weight 20% of the total marks for the course
Maximum of 8 pages and maximum of 2,500 words, in both
cases excluding cover sheet, bibliography and appendices.
A4 margin, at least 11 point type size. Use of typeface, margins
and headings consistent with a professional style.
Submission deadline 5:00pm, Friday, 16 March
Submission mode Electronic, PDF via Wattle
Estimated time 15 hours
Penalty for lateness 100% after the deadline has passed
First posted: 19
Last modified: 19
Questions to: Wattle Discussion Forum
This assignment specification may be updated to reflect clarifications and modifications
after it is first issued.
It is strongly suggested that you start working on the assignment right away. You can submit
as many times as you wish. Only the most recent submission at the due date will be
In this assignment, you are required to submit a single report comprising your answers to
set questions in the form of a PDF file. You may also attach supporting information
(appendices) as one or more separate PDF files. Appendices will not be marked but may be
treated as supporting information to your answers.
This is a single-person assignment and should be completed on your own. Make certain you
use quality information and that you carefully reference all the material that you use. Any
material that you wish to quote must have the source clearly referenced. It is unacceptable
to present any portion of another author’s work as your own. Anyone found doing this, from
whatever source, will be will be penalised in marks and, in addition, CECS procedures for
plagiarism will apply.
The Australian Computer Society Code of Professional Conduct 2014 is expected to be
applied by all Computing Professionals in Australia. It sets out six values but stresses the primacy of the public interest as the overriding value. In 2017, the US Branch of the
Association for Computing Machinery (ACM), recognizing the ubiquity and far-reaching
impact of algorithms in daily lives, issued a Statement on Algorithmic Transparency and
Accountability incorporating seven Principles designed to address potential harmful social
discrimination due to bias. These two documents are provided with this assignment
specification and you must read them closely.
You must also read the paper, Clarke R. (2018) “Guidelines for the Responsible Application of
Data Analytics” Forthcoming, Computer Law & Security Review 34, 3 (Jul-Aug 2018) , that is
provided with this assignment specification and hereafter referred to as the Guidelines.
You are to consider the application of the ACS code of conduct, the 7 US ACM Principles and
Clarke’s Guidelines to the following fictitious precipitation events scenario.
Precipitation Events Scenario (from Clarke R. (2016) “Big Data, Big Risks” Information Systems
Journal 26, 1 (January 2016) 77-90, PrePrint at http://www.rogerclarke.com/EC/BDBR.html)
Historical rainfall data is acquired from many sources, across an extended period,
and across a range of geographical locations. The collectors, some of them
professionals but most of them amateurs, used highly diverse collection methods
and frequencies, with little calibration and few controls. The data is consolidated
into a single collection. A considerable amount of data manipulation is necessary,
including the interpolation of data for empty cells, and the arbitrary disaggregation
of long-period data into the desirable shorter periods. An attempt to conduct a
quality audit against such sources as contemporaneous newspaper reports proves to
be too expensive, and is curtailed.
Analytical techniques are applied to the data. Confident conclusions are reached
about historical fluctuations and long-term trends. Climate-change sceptics point to
the gross inadequacies in the database, and argue that climate-change proponents,
in conducting their crusade, have played fast and loose with scientific principles.
You must answer the following questions, clearly indicating which question you are
answering within your submission. The page lengths given for each question here are for
guidance only; the page lengths for the overall assignment prescribed elsewhere are
Question 1. (1 page) Consider the ACS code of conduct. For each of the six values and their
sub-parts, discuss whether the value was demonstrated in the scenario and to what extent.
If you consider any value largely irrelevant to the scenario, then a very brief reason for this
assessment is sufficient.
Question 2. (1/2 page) Consider the 7 US ACM Principles. Looking closely at Principle 1,
Awareness, discuss how this principle is applied (or not) in the scenario and identify any
“potential harm” that might have ensued. Question 3. (4 pages) One-by-one, consider every numbered guideline in Table 2 of Clarke’s
Guidelines for the responsible application of data analytics and identify, with justification,
whether the guideline is applicable or irrelevant to the scenario. If you argue that the
guideline is applicable in principle, but needs to be adapted to be more obviously applicable,
then propose a suitable re-phrasing of the guideline. For clarity and brevity, you may choose
to present your answer as a table, reusing the structure and numbering of Table 2, but
without repeating the text of table 2.
Question 4. (1 page) (a) Choose any single, numbered guideline (e.g. guideline 3.3) in Table
2 of the Guidelines that you consider to have been disregarded in the scenario, and discuss
how the failure to consider the guideline could have contributed to the response of the
“Climate-change sceptics” to the conclusions drawn from the analytic work of the scenario. (b) In
addition, identify any other potential consequences that could have occurred due to the failure to
consider that same guideline. For this purpose, the consequences you identify are not necessarily
explicit within the scenario description. You might find it helpful to think of this activity as
contributing to a risk assessment process prior to your hypothetical involvement in the analysis work
of the scenario.
An abstract or executive summary is not required. A cover sheet is optional and does not contribute
to the page count, but your report must have a clearly identified title and author on the first page
after any cover sheet. The author must be identified by both name and University U-number.
No particular layout is specified, but you should follow a professional style and use no smaller than
11 point typeface and stay within the maximum specified page count and the maximum specified
word count. Page margins, heading sizes, paragraph breaks and so forth are not specified but a
professional style must be maintained. Text beyond the page limit or word count limit will be treated
as non-existent. Appendices may be used and do not contribute to the page count, but appendices
might be only quickly scanned or used for reference and will not be specifically marked.
You should properly attribute the source documents provided for your assignment (but not this
assignment specification itself) and any other reference materials you choose to use. No particular
referencing style is required. However, you are expected to reference conventionally, conveniently,
and consistently. Your references should be sufficient to unambiguously identify the source, to
describe the nature of the source, and also to retrieve the source in online and (if possible)
traditional publisher formats.
An assessment rubric is provided. The rubric will be used to mark your assignment. You are advised
to use it to supplement your understanding of what is expected for the assignment and to direct
your effort towards the most rewarding parts of the work.
Your assignment submission will be treated confidentially, but it will be available to ANU
staff involved in the course for the purposes of marking. Assessment Rubric
This rubric will be used to mark your assignment. You are advised to use it to supplement your understanding of what is expected for the assignment and to
direct your effort towards the most rewarding parts of the work. Your assignment will be marked out of 100, and marks will be scaled back to contribute to
the defined weighting for assessment of the course.