Description
Homework #2
Guideline: Please complete the following problems and generate a PDF file. Please submit the PDF file and a separate zip file that contains all source code to Camino. Please refer to HomeworkFormat.pdf for the format of the submitted PDF file.
Problem 1 For the K-means clustering problem, when the binary indicators (responsibilities) π a e fi ed f k=1, 2, , K a d =1, 2, , N, de i e f he cl e ce e π¦ , k=1, 2, , K, ch ha he following objective function J is minimized:
Problem 2 Iris.xls contains 150 data samples of three Iris categories, labeled by outcome values 0, 1, and 2. Each data sample has four attributes: sepal length, sepal width, petal length, and petal width.
Implement the K-means clustering algorithm to group the samples into K=3 clusters. Randomly choose three samples as the initial cluster centers. Calculate the objective function value J as defined in Problem 1 after the assignment step in each iteration. Exit the iterations if the following criterion is met: π½ Iter 1 π½ Iter Ξ΅, where Ξ΅ 10 , and Iter is the iteration number. Plot the objective function value J versus the iteration number Iter. Comment on the result. Attach the code at the end of the homework.
Problem 3 Assume a data sample π± β β comes from one of two classes, πΆ1 and πΆ . Use logistic regression to do classification.
a. Write the math expression of the logistic regression output, and the criterion used for the final classification.
b. How many parameters (weights) need to be calculated/trained in this method?
Problem 4 Assume a data sample π± β β comes from one of πΎ classes, πΆ1, πΆ , , πΆ . Use logistic regression to do classification.
a. Write the math expression of the logistic regression output, and the criterion used for the final classification.
b. How many parameters (weights) need to be calculated/trained in this method?
Reviews
There are no reviews yet.