CSE5334 Data Mining

Download Report

Transcript CSE5334 Data Mining

CSE3330
DATABASE SYSTEMS - I
CSE3330 DB I, Spring 2014
Lecture 1: Introduction
Department of Computer Science and Engineering, University of Texas at Arlington
©Chengkai Li, 2014
Self Introduction

Chengkai Li
http://ranger.uta.edu/~cli

Research interests:
big data management and mining

Courses that I teach:

CSE3330 DBI (Spring14,13,12,11)

CSE4334/5334 Data Mining (Fall13,12,11,10,09,08)

CSE6339



Crowdsourcing, Knowledge Graph, Computational Journalism (Spring14)

Grape Data Management and Mining (Spring13)

Computational Journalism (Spring12)

Web Search, Mining, and Integration (Spring11,10,09,08)
CSE3302 Programming Languages (Spring08, Fall07)
Looking for students: REU (Research Experiences for Undergraduates) Scholarship
Lecture 1: Introduction
CSE3330 DB I, Spring 2014
UT-Arlington © Chengkai Li, 2014
2
Background Check

Prerequisite:
CSE 2320 ALGORITHMS & DATA STRUCTURES
or
CSE 2321 DATA STRUCTURES FOR NON-ENGINEERS
Lecture 1: Introduction
CSE3330 DB I, Spring 2014
UT-Arlington © Chengkai Li, 2014
3
Course Page


http://crystal.uta.edu/~cli/cse3330
 Syllabus, Schedule (lecture notes).
Course announcements will be made at BlackBoard.
http://www.uta.edu/blackboard/
Lecture 1: Introduction
CSE3330 DB I, Spring 2014
UT-Arlington © Chengkai Li, 2014
4
Basics




Lectures: Tue/Thu 3:30-4:50am, ERB130
Office Hours: Friday 2:00-4:00pm, ERB628
Contact: cli [at] uta [dot] edu, (817) 272-0162
TA: TBA
Lecture 1: Introduction
CSE3330 DB I, Spring 2014
UT-Arlington © Chengkai Li, 2014
5
Textbook

Required Textbook:
Ramez Elmasri and Shamkant Navathe. Fundamentals of Database Systems (6th
Edition), Addison-Wesley Publishers, April 2010. ISBN 0136086209.

Reference Textbook:

Abraham Silberschatz, Henry Korth, and S. Sudarshan, Database System
Concepts, McGraw-Hill Publishers, 2010. ISBN 0073523321.

Hector Garcia-Molina, Jeffrey D. Ullman and Jennifer Widom, Database
Systems: The Complete Book (2nd Edition), Prentice Hall. 2008. ISBN
0131873253.

Raghu Ramakrishnan and Johannes Gehrke, Database Management Systems
(3rd Edition), McGraw-Hill Publishers, 2002. ISBN 0072465638.
Lecture 1: Introduction
CSE3330 DB I, Spring 2014
UT-Arlington © Chengkai Li, 2014
6
Disclaimer: the slides

The slides highlight the gist of the most important concepts and techniques.

But



It is not meant to be complete. Details may not be included.
It may be simplified for ease of explanation in limited time and space.
You may not do well in the course if you just read the slides.
 You need to read the book and study the slides carefully.
Lecture 1: Introduction
CSE3330 DB I, Spring 2014
UT-Arlington © Chengkai Li, 2014
7
Tentative Grading Scheme

Midterm

Final

Homework (HW)
20%
(Must be done independently)

Course Project
30%
(Must be done independently)

20%
30%
Final Letter Grade:
 No pre-defined cutoffs. Will be based on the curve of students’
performance.
Lecture 1: Introduction
CSE3330 DB I, Spring 2014
UT-Arlington © Chengkai Li, 2014
8
Homework (HW)



Problem solving
Focus on most important topics
HW1,HW2, HW3, HW4, 5% each
Lecture 1: Introduction
CSE3330 DB I, Spring 2014
UT-Arlington © Chengkai Li, 2014
9
Projects (P1-P3)

3 Programming Assignments, 10% each
 More
hands-on experience
 Mostly implementation
Lecture 1: Introduction
CSE3330 DB I, Spring 2014
UT-Arlington © Chengkai Li, 2014
10
Exams
Midterm:
Thursday, March 6th, 3:30pm-4:50pm, ERB130

Final: (comprehensive, covers the whole semester)
Thursday, May 8th, 2:00pm-4:30pm, ERB130

Lecture 1: Introduction
CSE3330 DB I, Spring 2014
UT-Arlington © Chengkai Li, 2014
11
BlackBoard



http://www.uta.edu/blackboard/
Announcement
Student assignment submission (we don’t accept
email submission or hard-copy)
 HW1-HW4
 P1-P3


Grades
Questions, Discussion
Lecture 1: Introduction
CSE3330 DB I, Spring 2014
UT-Arlington © Chengkai Li, 2014
12
Deadlines




Everything will be submitted through BlackBoard.
What if Blackboard experiences technical failure?
 Email your assignment to us
 Also email a screenshot displaying the technical failure
 We will verify with the University about the failure.
Due time: 11:59pm
Late submission: 5-point deduction per hour, till you get 0. (The raw score of
each assignment is 100. So there is no point to submit it after 20 hours).
Lecture 1: Introduction
CSE3330 DB I, Spring 2014
UT-Arlington © Chengkai Li, 2014
13
Regrading


7 days after we post scores in BlackBoard. TA will
handle regrade requests. Won’t consider it after 7
days.
If not satisfied with the results, 7 days to request
again. Instructor will handle it, and the decision is
final.
Lecture 1: Introduction
CSE3330 DB I, Spring 2014
UT-Arlington © Chengkai Li, 2014
14
Topics








Database System Concepts and Architecture
Entity-Relationship Model
Relational Model
Database Design
Relational Algebra
SQL
Indexing
Overview of Database and Data Mining Research
Lecture 1: Introduction
CSE3330 DB I, Spring 2014
UT-Arlington © Chengkai Li, 2014
15
Schedule

http://crystal.uta.edu/~cli/cse3330
Lecture 1: Introduction
CSE3330 DB I, Spring 2014
UT-Arlington © Chengkai Li, 2014
16
Your Email


Make sure your MavMail works. We will only
contact you by your MavMail.
Check it on a regular basis.
Lecture 1: Introduction
CSE3330 DB I, Spring 2014
UT-Arlington © Chengkai Li, 2014
17
Get bored?

Do you watch Youtube?
Lecture 1: Introduction
CSE3330 DB I, Spring 2014
UT-Arlington © Chengkai Li, 2014
18
http://www.youtube.com/watch?v=gC2ew6qLa8U
http://www.youtube.com/watch?v=463gKcXDVzQ
Don’t do it. It’s not worth it.
We are very serious about this.
read & sign the statement
Lecture 1: Introduction
CSE3330 DB I, Spring 2014
UT-Arlington © Chengkai Li, 2014
19
Academic Dishonesty

Cheating on an examination includes:
1.Copying from another's paper, any means of communication with another during
an examination, giving aid to or receiving aid from another during an examination;
2.Using any material during an examination that is unauthorized by the proctor;
3.Taking or attempting to take an examination for another student or allowing
another student to take or attempt to take an examination for oneself.
4.Using, obtaining, or attempting to obtain by any means the whole or any part of
a not-yet-administered examination.


Plagiarism is the unacknowledged incorporation of another's work into work which
the student offers for credit.
Collusion is the unauthorized collaboration of another in preparing work that a
student offers for credit.
Lecture 1: Introduction
CSE3330 DB I, Spring 2014
UT-Arlington © Chengkai Li, 2014
20
University Policy

http://www.uta.edu/conduct/faculty/suspecteddishonesty.php
Lecture 1: Introduction
CSE3330 DB I, Spring 2014
UT-Arlington © Chengkai Li, 2014
21