AI News, Data Science Blog

Data Science Blog

One of my favorite parts of machine learning in Python is that it got the benefit of observing the R community and then emulating the best parts of it.

One thing that is a blessing and a curse in R is that the machine learning algorithms are generally segmented by package.

Meaning instead of having a single (or set) of ML libraries that each implement some common algorithms, each algorithm gets its own package.

It's sort of nice because you can find very esoteric, cutting edge implementations of algorithms, but it can be a pain for day-to-day use where you might be switching between algorithms.

pandas took the best parts of data munging in R and turned it into a Python package.

And if you're in the market for some super slick, great looking interactive plots then try out bokeh.

It has a fantastic built-in regular expressions library, re, and a built-in string meta-libarary appropriately called string.

We released the very first version of Rodeo just over a year ago and released the 2.0 for Windows, OSX, and Linux about a month ago.

Jupyter notebooks provide an interactive environment for programming in Python (and other languages) that focuses on reproducibility and visualization--it even has a plugin for R!

Same concept: write SQL queries against your data frames, get data frames back!

[Part 2] How to Build a Sports Betting Model

One of the most common questions we get at Clear Data Sports is: "How can I build a sports betting model using analytics?" It's a great question, and there is not ...

The Computational Complexity of Machine Learning

In this episode, Professor Michael Kearns from the University of Pennsylvania joins host Kyle Polich to talk about the computational complexity of machine ...

Linnea Gandhi - How Businesses Can Apply Behavioral Science

In this week's episode, our hosts sit down with Linnea Gandhi, managing partner of the boutique consulting firm BehavioralSight and Adjunct Assistant Professor ...


Dataiku, le big data en toute simplicité. Rendez-vous avec Florian Douetteau, pdg de Dataiku. L'entreprise est membre du réseau Bpifrance Excellence et ...

Machine Learning avec Spark, MLLib et D3.js

Cette conférence a pour objet de partager avec les participants le processus d'intégration d'un système de Machine Learning (ML) dans une application Java ...

[MINI] Automated Feature Engineering

If a CEO wants to know the state of their business, they ask their highest ranking executives. These executives, in turn, should know the state of the business ...

Curso de HTML5 - 06 - Parágrafos, Quebras e Símbolos Especiais - by Gustavo Guanabara

Curso de HTML5 é um projeto do site e vai criar um site completo utilizando a tecnologia mais recente do mercado: o HTML5. O professor ...

[MINI] Feed Forward Neural Networks

Feed Forward Neural Networks In a feed forward neural network, neurons cannot form a cycle. In this episode, we explore how such a network would be able to ...