Post Type

Los profesores de NYU: John Langford y Yann LeCun abrieron su clase de Big Data, Large Scale Machine Learning.

No es MOOC (léase Coursera), pero es un lujo poder mirar los videos de las clases y seguir los apuntes.

  • Introduction
  • Online methods for linear models
  • Online methods for nonlinear models
  • LBFGS
  • Boosted Decision Trees and stumps
  • Mapreduce/Allreduce
  • Hadoop
  • Parallelization of learning algorithms: OpenMP, CUDA, OpenCL
  • Inverted Indices & Predictive Indexing
  • Feature Hashing
  • Locally-sensitive Hashing & Linear Dimensionality Reduction
  • Nonlinear Dimensionality Reduction
  • Feature Learning
  • Handling Many Classes, class embedding
  • Active Learning
  • Exploration and Learning

This course is for people interested in automatically extracting knowledge from large amounts of data. Students should have some prior knowledge or experience with basic machine learning methods.

You must have taken a machine learning course at the undergraduate or graduate level prior to taking this course, or have industry experience with machine learning.

Required skills:

  • knowledge of basic methods in machine learning such as linear classifiers, logistic regression, K-Means clustering, and principal components analysis.
  • although much of the assignments will use dynamic/scripting programming languages, some proficiency in C programming will be assumed
  • knowledge of basic concepts in probability and statistics: probability distributions and probability density functions, conditional probabilities, marginalization, Bayes’ theorem
  • basic knowledge of linear algebra and multivariate calculus: linear system solving, eigenvalues/eigenvectors, least square minimization, gradient, Jacobian, and Hessian.