By the end of this course, you will be able to:

  • Why do the relational databases are not always suitable for big data systems that are deployed in big data contexts.
  • Why the python language is a language widely used in the field of processing large amounts of data. This course introduces you to programming with this language, particularly using the library Numpy.
  • What statistical analyzes require big data processing and prediction.

This training provides you with the basic concepts in statistics such as :

  • random variables,
  • differential calculus,
  • convex functions,
  • optimization problems,
  • regression models.

These bases are applied on a classification algorithm on perceptron.