title: CSCI-362-02 modified: 2021-08-25 tags: teaching, math slug: CSCI362-02 label: CSCI362-02 authors: Evan Misshula summary: Syllabus
|Class Number||CSCI 362-02|
|Databases and Data Mining|
There were 5 Exabytes of information created between the dawn of civilization through 2003, but that much information is now created every 2 days.
~Eric Schmidt, Executive Chairman, Google
There has been a revolution in machine learning and data science in the last ten years. This class will introduce you to the major models and mathematics of of effective learning models to make it easier for you to learn and use these languages to write elegant, fast and maintainable code.
The class will be a success if some of these tools make the rest of your career (research, government or industrial) more productive and enjoyable.The syllabus will be updated as the course progresses. The schedule is not currently set and will be starting to be updated soon. You are welcome (and encouraged) to make your own recordings of the lectures.
Location and Time
- Where: Rm: 6.64.02, New Building, CUNY John Jay
- When: Mondays and Wednesdays 5.55 -7.10
Computers in the classroom
This is a hands on class. You will be in front of a computer during class throughout the semester. Please feel free to use the computer to take notes or use the class Etherpad to discuss the course material, but do not use the computers for non-class related topics.
What is the goal of the class?
Introduce you to both databases such as mysql and postgres and data mining including Natural Languaage Processing, Logistic Regression, K-nearest neighbors, support vector machines, supervised and unsupervised learning, reinforcement learning, neural networks and causal inference.
If the class is successful students will be able to use general principles for data representation & manipulation. They will write efficient, maintainable and modular code.
If the class is successful students will be able to create abstract data types. They will understand the mathematics of both additive and product types.
If the class is successful students will be able to use techniques to analyze the advantages and disadvantages of a programming paradigm for a given problem.
Students will become familiar with tools for contributing to open source projects and open source resources for learning new languages, joining new programming communities and obtaining employment in emerging languages.
Upon completing the course, you should be able to:
- Connect to the Discord channel for your language, construct a minimal example and ask an appropriate question.
- Write Haskell programs using recursion, higher order functions and user defined types.
- Write a Python library
- Understand the scope rules, type system, parameter passing and data abstraction of a language.
- Describe the fudamentals of compilers and interpreters
Most of these are drawn from the ACM’s CS Curriculum 2013
What will be covered?
All of the technologies have been chosen to be readily available and likely to be around for many years to come. You will gain experience with the following (do not worry if you have not heard of some or all of these). Because this is a one semester class, we will not be able to go into depth on these topics, but this should provide the foundation for you to learn more when you need or want to.
Note: this is not the order that we will cover the material and we might not get to all of these in class.
- Databases CRUD operations
- Databases Configurartion
- Natural Languaage Processing
- Logistic Regression
- K-nearest neighbors
- decision trees
- random forests
- support vector machines
- supervised and unsupervised learning
- reinforcement learning
- neural networks
- causal inference.
There will a few programming assignments. The problems will be difficult. There will be frequent quizzes at the beginning and end of class. If there is no test your grade for the day will be based on your participation and preparation will determine your grade for the day. Successful completion of all homework should guarantee a grade of ‘A’. In addition there will a midterm and final exam in case students are struggling. Be warned, this is a demanding class.
The general rule of thumb regarding college studying is, and has been for a long time, that for each class, students should spend approximately 2-3 of study time for each hour that they spend in class. Many students carry a course load of 15 credits, or approximately 15 hours of class time each week.
If students are spending considerablely more than 7.5 hours per week on the homework, they are advised to speak with the instructor and adjustments will be made. If you put int the time, there is no reason not to get an outstanding mark. Do not try to game the system by not doing the homework and making it up on the exams. That strategy has not been successful yet.
Advanced Data Structures & Programming Languages (CSCI 373 & CSCI 374)
Tuesday & Thursday 1 pm to 5 pm Jennifer Holst
Students who find that they need assistance at other times and days should try to set up specific appointments by contacting:
Freddy Velez email@example.com or Jennifer Holst Jennifer.firstname.lastname@example.org
Students enrolled in this course are required to attend all lecture, recitation and laboratory sessions of the section for which they registered. (During summer session, two weeks of classes are covered each calendar week.) Excessive absences (defined above) will result in a reduction in the grade. Attendance is taken solely from roll sheets circulated at the beginning and/or end of each session. Lateness or early departure (resulting in missing no more than 15 minutes of a session) counts as 1⁄2 absence. Students missing more than 15 minutes of a session will be counted as absent. If the college is officially closed, thereby canceling all classes, an announcement will be found on 237-8000, and broadcast on AM stations WINS (1010), WOR (710), WCBS (880), WADD (1280), WMCA (570), WLIB (1190), and WFAS (1230), as well as FM stations WCBS (101.1) and WBLS (107.5). If a class will be cancelled for extraordinary circumstances, the instructor will email an announcement to the preferred email of enrolled students as soon as practicable. This has not happened in recent memory.
ACTIVE COLLEGE E-MAIL
Students are expected to maintain active and accessible college email and Blackboard accounts. Blackboard will be used to send emails and may be used to post announcements, handouts, additional study materials, text supplements, grades, etc. Use the CUNY Portal Login page help features for a forgotten username or password, or contact DoIT, 212-237-8200 for other help. Verify your CUNY email address is correctly listed on Blackboard and keep the mailbox from filling up and refusing delivery, because you will be responsible for the contents of any email sent to that account.
When emailing instructors for this course, start the email’s subject line with the course and section number (e.g., CSCI 372-01) followed by a brief description. Include your full name in the body of every email. Emails that do not contain these descriptive details may be considered spam, and remain unopened and unanswered. Students are expected to check email regularly.
ADA STATEMENT: STUDENTS WITH DISABILITIES
Qualified students with disabilities will be provided reasonable academic accommodations if determined appropriate by the Office of Accessibility Services (OAS), 212- 237-8031, located in room L.66.00. Prior to granting disability accommodations, verification of a student’s eligibility must be timely received from OAS by the math department chairman, Professor Douglas Salane (email@example.com), and the instructor, from the OAS. It is the student’s responsibility to initiate contact with the OAS and to follow the established procedures for having the accommodation notice sent to both the course coordinator and the instructor.
Students who succeed in this course and graduate with a degree in Computer Science and Information Security may be hired by government or private agencies to analyze evidence and testify in a court of law, placing in jeopardy another person’s reputation and/or liberty. Dishonesty of any kind cannot and will not be tolerated. Students are expected to become thoroughly aware of the “John Jay College Policy on Academic Integrity” (and other college policies), available on the college’s Web site. Sanctions to the extent permitted by the policy will be imposed and any written material submitted may be transmitted by the instructor to Turnitin.com (or equivalent service) to help analyze its originality. See the Undergraduate Bulletin for the College’s Policy on Plagiarism and Cheating, which will be strictly enforced. Plagiarism includes copying ASA or homework answers from others. You are required to do your own work to avoid severe grade and disciplinary penalties. The College subscribes to Turnitin.com and Blackboard has a similar module called SafeAssign. Any written assignments submitted may be subject to evaluation by these or similar programs.