AI News, Pig Advantages and Disadvantages
- On Wednesday, June 6, 2018
- By Read More
Pig Advantages and Disadvantages
Apache Pig is a dataflow language that is built on top of Hadoop to make it
easier to process, clean and analyze 'big data' without having to write vanilla
Good old joins, distinct, union and many more commands are already in the language.
So what exactly Pig solves different than relational database is its applicability to 'big data' where it can crunch large files with ease and it does not need a structured data.
Data analysis matters because as original paper very good puts it: Data analysis is 'inner loop' of product innovation.
(So much for the analogy) Pig paper also introduces the basic motivation for Pig why it is useful and
as you read the paper you realize that the processing pipeline is actually Directed Acyclic Graph and paper goes a little more in depth in
- On Sunday, March 24, 2019
Ellen's Helping Out with Homework!
Is there anything she can't do? Ellen offered to help her viewers with their homework. This is how it turned out!
Lecture 3 | GloVe: Global Vectors for Word Representation
Lecture 3 introduces the GloVe model for training word vectors. Then it extends our discussion of word vectors (interchangeably called word embeddings) by ...
II Order my new book, Note To Self, here |
Enhancing the United State of Women Through Data Science
In conjunction with the United State of Women Summit, NASA hosted “Engaging Women & Girls in STEM Through Data Science,” at the agency's headquarters ...