MSA 8040: Database Management for Analytics

The course, Data Management for Analytics, provides an introduction to the database management and data analytics. The course emphasizes the understanding of the fundamentals of database design, ranging from the relational database model to entity relationship modeling, to normalization and to implementation, including both the basic structured query language (SQL) and advanced SQL such as procedure and Trigger. The course also provides an understanding of new developments and trends such as No-SQL (and database on AWS if time allowed). Finally, the course will deliver unstructured data analytics tools, including web scraping using beautifulsoup, selenium and API, textual analysis such as topic modeling and sentiment analysis for analyzing and understanding unstructured data. The course uses a problem-based approach to learning.

MSA 8040: Database Management for Analytics, Fall Semester 2022
Institute for Insight, Robinson College of Business, Georgia State University

Face Coverings

The University System of Georgia (USG) continues to recognize COVID-19 vaccines and boosters offer safe, effective protection and urges all students, faculty, staff and visitors to get vaccinated and/or boosted either on campus or with a local provider. As USG works closely with the Georgia Department of Public Health to prioritize the health and safety of campus communities, the system encourages people to wear masks based on their preference and assessment of personal risk.

For more information about Covid, please visit Coronavirus Resources @GSU!!!

Instructors

  • Houping Xiao (lead instructor; hxiao[at]gsu.edu)
  • Keerthi Vardhan Kondepati (teaching assistant; kkondepati1[at]student.gsu.edu)

Meeting Times and Locations

  • Thursdays 6:00 PM – 8:30 PM (Eastern Time), GSU Buckhead Executive Ed Ctr 1203

Office Hours

  • Houping Office Hours: Thursdays 2:30pm – 4:00 PM (Eastern Time), GSU Buckhead 329 (Please check with TA first)
  • Keerthi Office Hours:
    • Monday (virtual) 6:00pm – 7:30 PM (Eastern Time), https://gsumeetings.webex.com/gsumeetings/j.php?MTID=m52c9c3919b07c971e66521eb68f28f24
    • Wednesday (in-person) 6:00pm – 7:30 PM (Eastern Time), Insight Lab

    Course Goals

    • The students will be confident to solve Database problems in their job interviews
    • The students will be able to build or query Database in their future jobs
    • The students will be capability work effectively as a group leader or member
    • The students will be capability to develop advanced presentation skills in presenting their works
    • The students will be well prepared for MSA 8770 Text Analytics, offered in Fall 2021 and for the sprint projects

    Course Outcomes

    At the completion of this course, students are expected to obtain the following:

    • SLO1: When asked about the concepts in database, students should be able to:
      • SLO1.1: Explain in own words concepts in both SQL and NO-SQL
      • SLO1.2: Differentiate between SQl and NO-SQL
    • SLO2: When asked about the concepts in database, students should be able to:
      • SLO2.1: Build a SQL, NoSQL database in their devices or on AWS
      • SLO2.2: Implement query in both SQL, NO-SQL databases, and database on AWS
    • SLO3: When asked about the data analytics, students should be able to:
      • SLO3.1: Differentiate between structured and unstructured data
      • SLO3.2: Extract useful information for unstructured data using Web Scraping
      • SLO3.3: Differentiate between unsupervised and supervised learning framework, and be able to implement textual analysis including topic model and sentiment analysis for textual data

    Course Texts and Resources

    • MongoDB
    • MySQL
    • Topic Modeling
    • Carlos Coronel and Steven Morris. Database systems: Design, Implementation, & Management. 13th Edition. Cengage Learning. ISBN-13: 978-1337627900
    • Jiawei Han, Micheline Kamber and Jian Pei. Data Mining: Concepts and Techniques. 3rd Edition. Elsevier. ISBN-10: 0123814790.

    Grading

    Percentages of course works in students’ final scores are as follow:

    • Attendance and Class Participation - 15%
    • Mid-term Exam - 25%
    • Project Assignments - 60% (3 * 20%)
    • Final grade
      • A+ [97%, 100%]
      • A [93%, 97%)
      • A- [90%, 93%)
      • B+ [87%, 90%)
      • B [83%, 87%)
      • B- [80%, 83%)
      • C+ [77%, 80%)
      • C [70%, 77%)
      • D [60%, 70%)
      • F [0%, 60%)
Note:
  • The instructor reserves the right to modify the grading scale so as to improve the letter grade if warranted by the circumstances (e.g., unusually high level of difficulty of problem sets).

  • If GSU is closed for any reason on a scheduled class day, you should be prepared to adjust the schedule accordingly. In other words, the material/exam to be covered/taken on the day in which GSU is closed will be covered/taken in the next class.

Topics Schedule

The course syllabus provides a general plan for the course; deviations may be necessary.

Week Date Topic Code Assigned Due
1 08/25 Logistis
Introduction to Database
2 09/01 SQL: Relational Model & Relational Algebra
3 09/08 SQL: ER Model, ER diagrams, Normalization
4 09/15 Introduction to MySQL Server and Workbench, Data types SaleCO| MovieCO Proj1|Data
Functions, Operators and Statement Syntax SQLStatements | SQLSelect
5 09/22 Structured Query Language: MySQL Join| Subqueries
6 09/29 Advanced Structured Query Language: MySQL Procedure & Trigger
7 10/06 Procedure & Trigger & API API| Trigger Proj2 Proj1 (10/09)
8 10/13 Mid-term Exam
9 10/20 Introduction to Unstructed Data analysis
MongoDB
10 10/27 MongoDB: CRUD & Aggregation Operators
11 11/03 HTML & Web Scraping Proj3 Proj2 (11/06)
12 11/10 Topics Modeling
13 11/17 Sentiment Analysis
14 11/24 Thanksgiving Break
15 12/01 Demo of Proj3 (In-Class or Virtual) Proj3 (12/11)

Course and University Policy Statements

Syllabus Change Policy

This syllabus provides a general guideline for the conduct of this course. However, deviations will be necessary. Updates will be given during the semester and posted online through iCollege.

Class Website on iCollege

This website includes a copy of this syllabus (and any subsequent updates or changes), lecture slides, lab notes, readings, and information about the final project. It is your responsibility to check this website frequently for announcements and updates. Copies of class handouts and presentation slides will be posted on the class website before each week’s classes (if not earlier). You may find it helpful to use these to take notes during class.

Make-up Exam Policy

There is one mid-term exam in this course. Date for the exam is already set on the Tentative Course Schedule above. If there is an excusable reason for being unable to be present during the exam dates, please let the instructor know as soon as possible to schedule a make-up exam. The make-up exam if at all possible, will take place before the scheduled exam date. Students with unexcused absences for an exam will earn a 0 on the exam.

Assignments Submission Policy

  • Homework and Projects are usually assigned during Thursday classes
  • Homework and Project Dues are by default SUNDAYS 11:59 pm (EST). You will have around 4 weeks to finish each project.
  • Assignments must be submitted to iCollege online.
  • Students can submit late with the penalty of 25% deduction for every 12 hours late (up to 2 days)
  • After 2 days, no more late submission is allowed

Course Evaluation and Evolution

The evaluation for the mid-term exam and 2 individual projects will be provided in the following week. And, online appointments with the instructor are available if you want to discuss your performance. Your constructive assessment of this course plays an indispensable role in shaping education at Georgia State. Upon completing the course, please take time to fill out the online course evaluation.

Attendance Policies

Students should be expected to attend class if they do not have an excused absence because of illness. We have a process for students seeking excused absences through the Dean of Students Office. Students submit documentation to https://deanofstudents.gsu.edu/student-assistance/professorabsence- notification/ . Professors will then be notified by the Dean of Students of any excused absence without the need to manage medical information individually.

Academic Honesty

Being responsible for your own learning does not mean that you must always work in isolation. However, when working in groups we encourage you to be mindful of how much effort and Learning you are experiencing. Below, we outline our expectations for work in this course. For projects, I encourage students to work together to solve and understand the problems. Nevertheless, each student is responsible for demonstrating he or she has good grasp of the material. Ultimately, each student’s project solution should reflect his or her own learning and be written in the students’ own words. While students may work together to figure out how to solve the problems, each student must run his or her own analyses and turn in their own output. For the Exam, each should work independently, no discussion is allowed. Under no circumstance should a student email his or her project solutions, project reports, and codes to a classmate. Working together (for the project) is for the purpose of collaborating, not copying. “As members of the academic community, students are expected to recognize and uphold standards of intellectual and academic integrity.” As listed on Policy on Academic Honesty.

Plagiarism (Cheating): Talking over your ideas and getting comments on your writing from friends are NOT examples of plagiarism. Taking someone else’s words (published or not) and calling them your own IS plagiarism. Plagiarism has dire consequences, including flunking the paper in question, flunking the course, and university disciplinary action, depending on the circumstances of the office. The simplest way to avoid plagiarism is to document the sources of your information carefully.

Homework: When discussing problems from assigned homework with other students, you may:

  • Discuss the material presented in class or included in the assigned readings needed for solving the problem(s)
  • Assist another student in understanding the statement of the problem (e.g., you may assist a non-native speaker by translating some English phrases unfamiliar to that student)

It is expected that you have independently arrived at solutions that you turn in for problem sets. The following are examples of activities that are PROHIBITED:

  • Sharing solutions or fragments of solutions (via email, discussion groups, social media, whiteboard, handwritten or printed copies, etc.)
  • Posting solutions or fragments of solutions in a location that is accessible to others
  • Using solutions or fragments of solutions provided by other students (including students who had taken the course in the past)
  • Using solutions or solution fragments obtained on the Internet or from solution manuals for textbooks

Project: When discussing laboratory assignments, you may:

  • Discuss the material presented in class or included in assigned readings, documentation, user manual, etc.
  • Assist another student in understanding the statement of the problem (e.g., you may assist a non-native speaker by translating some English phrases unfamiliar to that student)
  • Discuss high-level ideas about how to complete the lab assignment, including problem specification, general strategies for the solution, strategies for debugging and testing code, etc. without examining code written by other students, or sharing code written by you with other students.

It is expected that you have independently arrived at solutions that you turn in for laboratory assignments. The following are examples of activities that are PROHIBITED:

  • Examining, copying of code or code fragments from someone else (including online sources), other than the code that is provided to you by the instructor or included in the reference books.
  • Sharing code or code fragments (via email, discussion groups, social media, whiteboard, handwritten or printed copies, etc.)

If a “friend” asks you to show him/her your code (especially if the request is to receive a copy of your code), you are opening the door wide for a possible charge of academic misconduct for both of you. I have seen friendships crumble when student A innocently supplies a copy of his/her code to student B, who then plagiarizes it, getting both in trouble. Do not be an accessory; truly help a friend by saying no. The best source for help on these assignments is the instructor or the teaching assistant. We are experienced in providing the right kind of information and help.

Exam: It is expected that you have independently arrived at solutions that you turn in for exams. The following are examples of activities that are PROHIBITED:

  • Copying someone else’s solution
  • Using notes, online resources, or other reference materials (unless instructed otherwise)
  • Seeking, obtaining or providing help on an exam via phone, text messaging, email, social media
  • Altering a graded exam for re-grading
  • Getting an advance copy of the examination
  • Facilitating another student to cheat (e.g., by allowing him or her to copy your solution)
  • Having someone else write the exam amount to cheat on an exam.

You need to exercise special care with take-home exams. You should NEVER

  • Share solutions or fragments of solutions (via email, whiteboard, handwritten or printed copies, etc.)
  • Post solutions or fragments of solutions in a location that is accessible to others
  • Use solutions or fragments of solutions provided by other students (including students who had taken the course in the past)
  • Use solutions or solution fragments obtained on the Internet or from solution manuals for textbooks
  • Use material from textbooks, reference books, online resources, or research articles without properly acknowledging and citing the source

! Warning

  • Violation of the Academic Integrity policy will result in an automatic F for the concerning submission.
  • Two violations ⇒ fail grade in the course
  • Have discussions about homework. Every student should submit their own homework with the names of students in the discussion group explicitly mentioned.

Special Needs

Students who wish to request accommodation for a disability may do so by registering with the Office of Disability Services. Students may only be accommodated upon issuance by the Office of Disability Services of a signed Accommodation Plan and are responsible for providing a copy of that plan to instructors of all classes in which accommodations are sought. Students with special needs should then make an appointment with the instructor during the first week of the class to discuss any accommodations that need to be made.

FERPA

In keeping with USG and university policy, this course website will make every effort to maintain the privacy and accuracy of your personal information. Specifically, unless otherwise noted, it will not actively share personal information gathered from the site with anyone except university employees whose responsibilities require access to said records. However, some information collected from the site may be subject to the Georgia Open Records Act. This means that while we do not actively share information, in some cases we may be compelled by law to release information gathered from the site. Also, the site will be managed in compliance with the Family Educational Rights and Privacy Act (FERPA), which prohibits the release of education records without student permission.

Sexual Harassment

In instances of sexual misconduct, the present instructor(s) and teaching assistants, are designated as Responsible Employees who are required to share with administrative officials all reports of sexual misconduct for university review. If you wish to disclose an incident of sexual misconduct confidentially, there are options on campus for you do so. For more information on this policy, please refer to the Sexual Misconduct Policy which is included in the Georgia State University Student Code of Conduct (http://codeofconduct.gsu.edu/).

Basic Needs Statement

Any student who faces challenges securing their food or housing and believes this may affect their performance in the course is urged to contact the Dean of Students for support. Furthermore, please notify the professor if you are comfortable in doing so. This will enable us to provide resources that we may possess. The Embark program at GSU provides resources for students facing homelessness.