DSAA 5012, Spring 2021
Course Title: Advanced Database Management for Data Science

 

Time: Tue, Thur  04:30PM - 05:50PM

Instructor:

Lei Chen (send e-mail for questions regarding the class and for arranging individual meetings)

Projects

Project  

Course Description

In this course, the concepts and implementation schemes in advanced database management systems for data science applications will be introduced, such as disk and memory management, advanced access methods, implementation of relational operators, query processing and optimization, transactions and concurrency control. It also introduces emerging database related techniques for data science. 

Topics:

1) Basic principles of relational databases

2) Graph data processing

3) Uncertain data processing

4) Time series data processing

5) Multimedia data processing

Marking Scheme: 

The course work includes assignments and projects, and a final exam.  The marking scheme is as follows.

Course Material:

Textbooks (References)

1.    Database System Concepts, A. Silberschatz, H. Korth, and S. Sudarshan.

2.    Database Management Systems, Raghu Ramakrishnan and Johannes Gehrke.

3.    Data Mining -- Concepts and Techniques by Jiawei Han and Micheline Kamber. Morgan Kaufmann Publishers.

4.   Community Detection and Mining in Social Media, by Lei Tang and Huan Liu. Morgan & Claypool Publishers.

5.  Social Network Data Analytics, by Charu C. Aggarwal, Springer

 

A collection of papers from journals, conference proceedings, and website need to be read.

 

News:

 

Tentative Schedule

 

Week

Date

Lecture

Topic

Chapter

Notes

Worksheet/Answers(will be released after you submit your worksheet

Video

1

Feb. 2

1

Database Management Systems

1

Course Info, N1

-

 

Feb. 4

2

Entity-Relationship (E-R)
Model and Database Design

6

N2

 W2, A2

 

2

Feb. 9

3

N3

W3, A3

 

Feb. 11

4

Relational Model and
Relational Algebra

2, 7

N4

W4, A4

 

3

Feb.  16

5

N5

W5, A5

 

Feb. 18

6

Relational Algebra (Cont.) SQL

2

N5(Cont), N6

W6, A6

 

4

Feb. 23

7

SQL

3, 4, 5

N7

W7, A7

 

Feb. 25

8

N8

W8, A8

 

5

March 2

9

Relational Database Design and Functional Dependency

7

N9

N10

W9, A9

W10, A10

 

March 4

10

 

6

March 9

11

Storage and File Structure

12, 13

N11

W11, A11

 

March 11

12

Indexing

14

N12

W12, A12

 

7

March 13

13

N13

W13, A13

 

March 16

14

N14

W14, A14

 

8

March 18

15

Query Processing

(Query Processing Costs)

15

 

N15

W15, A15

 

March 25

16

N16

W16, A16

 

9

March 30

17

Query Optimization

16

N17

W17, A17

 

April 8

18

N18

W18, A18

 

10

April 13

19

Transactions

17

N19

W19, A19

 

April 15

20

Concurrency Control

18

N20

W20, A20

 

11

April 20

21

N21

W21, A21

 

April 22

22

Database Recovery

19

N22

W22, A22

 

12

April 27

23

Graph  Databases

 

 

 

April 29

24

Graph Databases

 

 

 

 

13

May 4

25

Uncertain Databases

 

 

 

 

May 6

Review

Final Exam Review

-

-