-
Notifications
You must be signed in to change notification settings - Fork 66
Open
Description
From @kromerma reading first few chapters:
First chapters feedback
- explain locality means disk head moving & also computer-computer moving
- put in "numbers everyone should know"
- move first section to end
- expand on the "Hadoop is good at" list: unruly, huge, connected, dimensional
-
using the wikip data, show things db is good at & move to progressively less and less
- dimensional: with lots of properties am effectively making table on fly
- connected: flights -> airports -> geo -> weather stations -> weather // time
- in "since there have been ten computers" part: Don't use phrase "talk to", say "Access data from"
After E&C Inc. save xmas:
Walk through carefully the geo-flavor example:
- chimpanzee is labelling records to go to same place
- ... turning one record into zero one or many (spam, solo, gift for me & sis)
- (also, unstructured => structured)
- practical example: market basket analysis
Metadata
Metadata
Assignees
Labels
No labels