Create M/R software look at /res/prenoms.csv on hdfs (hdfs dfs -cat). It contains semicolon separated csv exports of first names. The header (column name) is name;gender (m/f) ;origin ;version (don't pay attention to this value) When multiples values are possible (gender, origin), they are comma separated. Create M/R Softwares to get these stats (1 M/R per stat) (16pts): - Count first name by origin - Count number of first name by number of origin (how many first name has x origins ? For x = 1,2,3...) - Proportion (in%) of male or female
-
Notifications
You must be signed in to change notification settings - Fork 0
Create M/R software look at /res/prenoms.csv on hdfs (hdfs dfs -cat). It contains semicolon separated csv exports of first names. The header (column name) is name;gender (m/f) ;origin ;version (don't pay attention to this value) When multiples values are possible (gender, origin), they are comma separated. Create M/R Softwares to get these stats…
License
redcinelli/TP2_introHadoop
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Create M/R software look at /res/prenoms.csv on hdfs (hdfs dfs -cat). It contains semicolon separated csv exports of first names. The header (column name) is name;gender (m/f) ;origin ;version (don't pay attention to this value) When multiples values are possible (gender, origin), they are comma separated. Create M/R Softwares to get these stats…
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published