Big Data requires the storage,
organization, and processing of data at a scale and efficiency that go
well beyond the capabilities of conventional information
technologies. In this course, we will study the state of the art in
big data management: we will learn about algorithms, techniques and
tools needed to support big data processing. In addition, we will
examine real applications that require massive data analysis and how
they can be implemented on Big Data platforms.
The course will consist of lectures based both on textbook material
and scientific papers. It will also include programming assignments
and a final project that will provide students with hands-on
experience on building data-intensive applications using existing Big
Data platforms, including Amazon AWS. Besides lectures given by the
instructor, we will also have guest lectures by experts in some of the
topics we will cover.
For programming assignments, the instructor and graders will run your code and your grade will depend on the correctness of the outputs. Therefore, you must strictly follow the guidelines given for the programming assignments to ensure we will be able to run them.
Programming assignments must be done individually, unless otherwise noted. Students must design and program their own solutions -- copying from other students or any other source is not acceptable.Students are required to follow the following rules about academic honesty: http://www.cs.nyu.edu/web/Academic/Graduate/academic_dishonesty.html
Lateness policy: Late quizzes, assignments, or projects will not be accepted without a note from your physician or from your employer.
To register use the class token provided in NYU ClassesMake sure to use your official NYU email (with your netid -- your initials and a number) when you register. External email addresses are not allowed.
The quizzes appear to be sets of mutiple-choice questions. But you should think of the questions as if you were asked to work an ordinary, "long-answer" question. Work that question and keep the answer handy on a piece of paper. The multiple-choice question will typically sample your knowledge of the correct answer. You can try the work as many times as you like, and we hope everyone will eventually get 100%. Also notice that you have to wait 10 minutes between openings, so brute-force random guessing will not work. You will be graded on your last submission .