
ãã®èšäºã§ã¯ãHadoopãšã³ã·ã¹ãã ã®ã³ã³ããŒãã³ãã®1ã€ã§ããHUEã«ã€ããŠæ€èšããŸãã ãHueyããŸãã¯ãHH Yu Yããšçºé³ããŸãããããç¥ãããŠãããã·ã¢èªã®åèªã§ããããªã¢ã³ããšã¯äžèŽããŸããã
HUEïŒHadoopãŠãŒã¶ãŒãšã¯ã¹ããªãšã³ã¹ïŒã¯ãHadoopeã®ããŒã¿ãåæããããã®Webã€ã³ã¿ãŒãã§ã€ã¹ã§ãã å°ãªããšãããã圌ãèªèº«ã®äœçœ®ã¥ãã§ãã HUEã¯ãApacheã©ã€ã»ã³ã¹ã®äžã§ãªãªãŒã¹ããããªãŒãã³ãœãŒã¹ãããžã§ã¯ãã§ãã å·çæç¹ã§ãClouderaã®HUEãææããŠããŸãã hadupã®æã人æ°ã®ãããã¹ãŠã®ãã£ã¹ããªãã¥ãŒã·ã§ã³ã«ã€ã³ã¹ããŒã«ãããŸãã
- Pivotal HD 3.0
- Apache bigtop
- HDInsight Hadoop
- ããããŒ
- Hortonworks HadoopïŒHDPïŒ
- Cloudera HadoopïŒCDHïŒ
ç¥èªHUEã翻蚳ãããšãããŠãŒã¶ãŒã®å°å±ã®ãŠãŒã¶ãŒãšã¯ã¹ããªãšã³ã¹ããŸãã¯ãHadupnyãŠãŒã¶ãŒãšã¯ã¹ããªãšã³ã¹ãã®ãããªãã®ãåŸãããŸãã ãããŠãæ¬åœã«ããã§ãã ç§ã¯ããã€ãã®ããã€ãã®çµéšãæã£ãŠHUEã«æ¥ãŸããã ç§ã¯ãã³ã³ãœãŒã«ãšCDMïŒã¯ã©ãŠããããŒãžã£ãŒïŒã®äž¡æ¹ã§ããŒããæäœããŸããã ã»ãšãã©äœ¿çšãããŠããSparkããã³YARNãã¬ãŒã ã¯ãŒã¯ãããã³ããããããªOozieã ãããŠãæéãçµã€ã«ã€ããŠãç§ã¯ããã€ãã®ã¢ã€ãã¢/ãŠãŒã¶ãŒèŠä»¶ãèç©ããŸããïŒ
- YARNã§ã¿ã¹ã¯ãéå§ããæ§æããã°ãã確èªã§ãããšäŸ¿å©ã§ã ã
- Clouderaã®ãã€ãã£ããã©ãŠã¶ãããæ©èœçãªãã¡ã€ã«ãã©ãŠã¶ã䜿çšã§ããã°çŽ æŽããããšæããŸãïŒéãšã³ã¿ãŒãã©ã€ãºããŒãžã§ã³ã䜿çšããŸããïŒã
- Oozieããã䟿å©ã§èªååãããã¿ã¹ã¯ã¹ã±ãžã¥ãŒã©ã䜿çšããã®ã¯çŽ æŽãããããšã§ãã
ããã¯ç§ã®ãŠãŒã¶ãŒãšã¯ã¹ããªãšã³ã¹ã§ãããHUEã«å®è£ ãããŸããã ãã¡ã€ã«ãã©ãŠã¶ããå§ããŸãã ã¯ã©ãŠããŒã®ãã€ãã£ããããã¯ããã«äŸ¿å©ã§ãã HDFSã®ããå Žæããå¥ã®å Žæã«ãã¡ã€ã«ãšãã£ã¬ã¯ããªãäœæ/ã³ããŒ/移åã§ããŸãã ã¢ã¯ã»ã¹æš©ãå€æŽããããŒã«ã«ãã·ã³ããHDFSã«ãã¡ã€ã«ãã¢ããããŒãããããŒã«ã«ãã·ã³ã«ããŠã³ããŒãããã«äžéšã®åœ¢åŒïŒ.txtã.seqïŒã®ãã¡ã€ã«ãéãããšãã§ããŸãã ã¿ã¹ã¯ã¹ã±ãžã¥ãŒã©ã¯ãæ¬è³ªçã«èªååãããOozieã§ãã HUEã¯job.propertiesãšworkflow.xmlãäœæããŸããäžè¬ã«ãOozieãæäœæ¥ã§è¡ããªããã°ãªããªãã£ããã¹ãŠã®ã«ãŒãã³äœæ¥ã¯ãHUEãè¡ã£ãŠãããŸãã 以äžã«ããã€ãã®äŸãæããŠãããã«ã€ããŠè©³ãã説æããŸãã
ããããHUEã¯ãã¡ã€ã«ãã©ãŠã¶ãšã¹ã±ãžã¥ãŒã©ïŒã¿ã¹ã¯ã¹ã±ãžã¥ãŒã©ïŒã§ããã ãã§ãªããã¯ã©ã¹ã¿ã®ã»ãŒãã¹ãŠã®ã¢ãžã¥ãŒã«ãžã®ã¢ã¯ã»ã¹ãæäŸããã¢ããªã±ãŒã·ã§ã³ã®ã»ããã§ãããã¢ããªã±ãŒã·ã§ã³ãéçºããããã®ãã©ãããã©ãŒã ã§ãã
ãã®èšäºã§ã¯ãHUE + OozieãHUE + YARNãHUE + SparkãHUE + HDFSã®ãã³ãã«ã«ã€ããŠèª¬æããŸãã
HUE + Oozie
ãŸããOozieã«ã¯3ã€ã®äž»èŠãªã¿ã¹ã¯ã¿ã€ããããããšãæãåºãããŠãã ããã
- ã¯ãŒã¯ãããŒã¯DAGã¢ã¯ã·ã§ã³ã°ã©ãã§ãã ãŸãã¯ããã·ã¢èªã§ã¯ãããçš®ã®ã¿ã¹ã¯ïŒMap Reduceã¿ã¹ã¯ãYARNã¿ã¹ã¯ãSparkã¿ã¹ã¯ãHDFSã¿ã¹ã¯ãªã©ïŒã«ãããŸããã
- ã³ãŒãã£ããŒã¿ãŒ-ããã¯ã¯ãŒã¯ãããŒã§ãããæå®ãããéå§æé/é »åºŠã§ãã
- ãã³ãã«ã¯ãOozieã®æé«ã¬ãã«ã®æœè±¡åã§ãã ããã¯ã³ãŒãã£ããŒã¿ãŒã®ã»ããã§ãããå¿ ãããçžäºæ¥ç¶ãããŠããããã§ã¯ãããŸããïŒç§ã¯äœ¿çšããŸããã§ãããç¹å¥ãªããšã¯äœãèšããŸããïŒã
ç»é¢ã®äžéšã«ã¯ãã¯ãŒã¯ãããŒãã»ã¯ã·ã§ã³ããããããã«ã¯ããã·ã¥ããŒããšãšãã£ã¿ãŒã®2ã€ã®ãµãã»ã¯ã·ã§ã³ããããããããã¿ã¹ã¯ã®ã¿ã€ãããšã«ãµãã»ã¯ã·ã§ã³ããããŸãã
Dashbordã¯ãå®è¡äž/å®è¡æžã¿ã®ã¿ã¹ã¯ãç£èŠããŸãã
Editorã¯Oozieã¿ã¹ã¯ãšãã£ã¿ãŒã§ãã
ããã·ã¥ããŒã
Dashbordã¯Oozieã¿ã¹ã¯ã衚瀺ããŸãã ã€ãŸã 18.00ã«ã¯ãŒã¯ãããŒïŒããšãã°ãYARNã§å®è¡äžã®ã¿ã¹ã¯ïŒãèµ·åããäœããã®ã³ãŒãã£ããŒã¿ãŒããããçŸåšã¯æ£åã«éããªãå ŽåãYARNãªãœãŒã¹ãããŒãžã£ãŒã§ã¯ãå®è¡äžã®ã¿ã¹ã¯ã®ã¿ã衚瀺ãããDashbordã«ã¯è¡šç€ºãããŸããå®è¡äž/æºåç¶æ ã§ãã³ã°ããŸãïŒå®è¡äž-ã¿ã¹ã¯ãå°ãªããšãäžåºŠå®è¡ãããŠããå Žåãããšãã°ã³ãŒãã£ããŒã¿ãŒãæšæ¥ãããã³ã°ããã¯ãŒã¯ãããŒãæšæ¥éå§ãããå Žå;æºå-ã¿ã¹ã¯ãå®äºããŠããªãå ŽåïŒ äžã®åçã¯ç§ã®ããã·ã¥ããŒãã®äžéšã瀺ããŠããŸããè¿œå 説æãªãã§ãã¹ãŠãæ確ã«ãªã£ãŠãããšæããŸãïŒæ©å¯æ å ±ãé»ã§ã«ããŒããŸããïŒã
ç·šéè
ã¯ãŒã¯ãããŒã»ã¯ã·ã§ã³ããå§ããŸãããã ãã®ã»ã¯ã·ã§ã³ã«ã¯ããã®ãŠãŒã¶ãŒã®ãã¹ãŠã®å©çšå¯èœãªã¢ã¯ã»ã¹å¯èœãªã¯ãŒã¯ãããŒã衚瀺ãããŸãã ãããããã¯ãŒã¯ãããŒãããã³ã¯ãŒã¯ãããŒã«é¢ããæ å ±ã®ã³ããŒãåé€ãã€ã³ããŒãããšã¯ã¹ããŒããããã³äœæãä»ã®ãŠãŒã¶ãŒãšå ±æã§ããŸãã æ°ããã¯ãŒã¯ãããŒãäœæããããã»ã¹ãæ€èšããŠãã ããã
ACTIONSè¡ã§ã¯ãã¯ãŒã¯ãããŒã®äœæã«åºã¥ããŠå¯èœãªãã¹ãŠã®ã¢ã¯ã·ã§ã³ãã€ãŸããHiveã¹ã¯ãªãããHive Server2ã¹ã¯ãªãããPigã¹ã¯ãªãããSparkããã°ã©ã ãJavaããã°ã©ã ãSqoop 1ãMap Reduceãžã§ãããµãã¯ãŒã¯ãããŒãã·ã§ã«ãSshãHDFS fsãé»åã¡ãŒã«ãã¹ããªãŒãã³ã°ãDistcpããã«ã
HUE + HDFS
ããããã©ã¡ãŒã¿ãŒåããªãããHDFSã«ãã£ã¬ã¯ããªãäœæããç°¡åãªã¯ãŒã¯ãããŒãäœæããŸãããã
ããã§ã$ {Dir}ã¯å€æ°ã§ããããã®å€ã¯ãã£ã¬ã¯ããªã®äœæå ã®ãã£ã¬ã¯ããªã«ãªããŸãã
$ {Year}ã$ {Month}ã$ {Day}ãå€æ°ã§ããããã®ç®çã¯æ確ã§ãã
ã¢ã¯ã·ã§ã³ã®åšå²ã«ç°è²ã®ãã£ãŒã«ãã衚瀺ãããŸããã¢ã¯ã·ã§ã³ãé 眮ããããšãã§ããŸãããã®ãããè€æ°ã®åºåãæã€åå²ã¯ãŒã¯ãããŒãååŸã§ããŸãããã®ãããªäœæ¥ã®äŸãåŸã§ç€ºããŸãã ã®ã¢ã¯ãã¢ã¯ã·ã§ã³ãšã¹ãããã¢ã¯ã·ã§ã³ã®ã³ãŒããŒã«ã衚瀺ãããŸãã ãããã®æ¯è»ãã¯ãªãã¯ãããšãèšå®ã¡ãã¥ãŒã«ç§»åããŸãã åã¿ã€ãã®ã¢ã¯ã·ã§ã³ã«ã¯ç¬èªã®èšå®ããããŸãããã·ãŒã±ã³ã¹ãªã©ã®å ±éã®èšå®ã»ããããããŸãã ã¿ã¹ã¯ãæ£åžžã«å®äºãããšãã«æž¡ãã¢ã¯ã·ã§ã³ãã¿ã¹ã¯ã倱æããå Žåã®ã¢ã¯ã·ã§ã³ã
ã¯ãŒã¯ãããŒãå®è¡ãããšãå€æ°ã®å€ãèšå®ããããã«æ±ããããŸãã ããã§ã¯ãã¯ãŒã¯ãããŒã«åºã¥ããŠã³ãŒãã£ããŒã¿ãŒãäœæããŸãããã

ãã¡ããã³ãŒãã£ããŒã¿ãŒã§ãã 察å¿ãããã£ãŒã«ãã«æéæ ãèšå®ããŸãããã¹ãŠãåçŽã§ãå¯äžã®æ©èœã¯éå»ã«ã¿ã¹ã¯ãå®è¡ã§ããããšã§ãã ã³ãŒãã£ããŒã¿ãŒã«ã¯ãåç®ãšå®æéã®2çš®é¡ã®æéããããŸãã å矩ã¯æéæ å ã«ãããã®ã§ãå®éã¯å®æéã§ãã ãã£ãšèšãããããšã¯ãé«åºŠãªæ§æããã®ãã§ãã¯ããã¯ã¹ãã¢ã¯ãã£ãã«ããããšãcrontab圢åŒã§é »åºŠãèšå®ããããšãå¯èœã«ãªãããšã§ãã åšæ³¢æ°ã§ã¯ããã¹ãŠãããã§ãã ã¯ãŒã¯ãããŒã®ãã©ã¡ãŒã¿ãŒã«ã€ããŠã¯ãELé¢æ°ïŒåŒèšèªé¢æ°ïŒã䜿çšããŠèšå®ããŸãã å€æ°yearã§æå®ãããé¢æ°ã¯ãå€-2016ãday-5ãè¿ããŸãããæãããã°ããèå³æ·±ã-åæã®çªå·ãè¿ããŸãïŒéå§ã¯é©åããŸããã§ããããæ¥ãšå¹Žãšåãã§ãïŒã Dirã¯å®æ°ã§ãã èšå®ã«ãããšãã³ãŒãã£ããŒã¿ãŒã¯1é±éã«1æ¥1åèµ·åããŸããã€ãŸãããã¹/ test / 2016/4 /ã«æ²¿ã£ãŠHDFSã«7ã€ã®ãã©ã«ããŒããããŸãã ããã«ãã®ãããªç°¡åãªäŸããããŸãããå®çšçãªã¢ããªã±ãŒã·ã§ã³ã¯ãããŸããããå°ãå€æŽããå Žåãããšãã°ãã¿ã¹ã¯ã§ãã©ã«ããŒãäœæããã«ãåæ¥/æ/幎ã®ãã°ãæã€ãã©ã«ããŒãåé€ãããšããã§ã«å©ç¹ããããŸãã
ããš+ã€ãŒã³
Javaããã°ã©ã ãMapReduceããã°ã©ã ãªã©ã®ã¢ã¯ã·ã§ã³ã䜿çšããŠã¯ãŒã¯ãããŒãäœæããHDFSãã³ãã«ã®äŸã®ããã«ãã©ã¡ãŒã¿ãŒåããããšãã§ããŸãã ã¿ã¹ã¯å®è¡ããã»ã¹ããã°ã«èšé²ãããHUEã®ãã°ãYARN Resource ManagerãŸãã¯History ServerããååŸãããŸãã å°ãã ã䟿å©ã«æ§é åãããŠããã ãã§ãïŒãã¡ãããããã¯ãã§ã«å¥œã¿ã®åé¡ã§ãïŒã ãŸããYARNã§ã¿ã¹ã¯ãçŽæ¥å®è¡ããããšãšã®ãã1ã€ã®éãã¯ãå°ãå€ãã®ãªãœãŒã¹ãå¿ èŠã«ãªãããšã§ãã Oozieã¿ã¹ã¯ãæåã«äœæãããããããã®ç®æšã¯ã¯ãŒã¯ãããŒïŒJava / MapReduce / Sparkã¿ã¹ã¯ïŒãåŒã³åºãããšã§ãã ãã®Oozieã¿ã¹ã¯ã¯ãã¯ã©ã¹ã¿ãŒããšã«1ã€ã®ã³ã¢ïŒå®éã®ã³ã¢ã§ã¯ãªãvcoreïŒãš1.5 Gbã®RAMãæ¶è²»ããŸãã
MapReduceããã·ãŒãžã£ãäŸãšããŠäœ¿çšããã¯ãŒã¯ãããŒã®Javaããã°ã©ã ã¢ã¯ã·ã§ã³ããåŒã³åºããã®ãã³ãã«ãæ€èšããŸãã

ããã¯ãã¯ãŒã¯ãããŒãã©ã®ããã«èŠãããã§ãã [Jar name]ãã£ãŒã«ãã§ãHDFSã«ããå®è¡å¯èœjarãã¡ã€ã«ãžã®ãã¹ã瀺ããŸãã ã¯ã©ã¹ã¿ãŒå ã®ãã¹ãŠã®ãã·ã³ã§è€è£œããå¿ èŠã¯ãªããHDFSã«é 眮ããã ãã§ãã ã¡ã€ã³ã¯ã©ã¹ãã£ãŒã«ãã¯ã¡ã€ã³ã¯ã©ã¹ã§ãã ããã§ã¯ããã©ã¡ãŒã¿ã§ãã ãã®åé¡ã§ã¯ããã¹ãŠããã©ã¡ãŒã¿ãŒåããŸãããããã¹ãŠã®ãã©ã¡ãŒã¿ãŒããã©ã¡ãŒã¿ãŒåããå¿ èŠã¯ãããŸããã 1ã€ã®éåžžã«éèŠãªç¹ã«æ³šæãã䟡å€ããããŸãã æž¡ããããã©ã¡ãŒã¿ãŒã®äžéšã¯-D_parameter_name = valueã®åœ¢åŒã§èšå®ãããäžéšã¯é åã®æåååŒæ°ãšããŠååŸãããŸãã ããã¯ããŸãâD圢åŒã®ãã¹ãŠã®ãã©ã¡ãŒã¿ãŒãèšå®ãã次ã«æåååŒæ°ãèšå®ããå¿ èŠããããŸãã ããªããããããåãããšã圌ã¯ããããééã£ãŠç¥èŠããŸãã ããšãã°ãæåã«âD圢åŒã®ãã©ã¡ãŒã¿ãŒã®äžéšãèšå®ããã次ã«éåžžã次ã«åã³âD圢åŒãèšå®ãããŸããããã®å ŽåãâD圢åŒãã©ã¡ãŒã¿ãŒã®2çªç®ã®éšåã¯æååãã©ã¡ãŒã¿ãŒãšããŠèªèãããŸãã ãã®æ©èœãç¹å®ããåã«ãå€ãã®æéãè²»ããå¿ èŠããããŸããã ããã§ã¯ãã¯ãŒã¯ãããŒã«åºã¥ããŠã³ãŒãã£ããŒã¿ãŒãäœæããŸãããã

åã®äŸã®ããã«äœæãããŸãã
HUE + Spark
ã¯ãŒã¯ãããŒã¯åã®äŸãšåæ§ã«äœæãããã³ãŒãã£ããŒã¿ãŒãåæ§ã§ãã 1ã€ã®æ©èœããããŸã-èµ·åæã«ã¡ã€ã³ã¯ã©ã¹ã®ãããªãšã©ãŒãçºçããjarãã¡ã€ã«ãHDFSã«é 眮ããå ŽåãHDFSãšåããã£ã¬ã¯ããªå ã®ãã¹ãŠã®ãã·ã³ã§jarãã¡ã€ã«ãè€è£œããå¿ èŠããããŸãã
åå²ããã¯ãŒã¯ãããŒã衚瀺ãããïŒ

ã¯ãŒã¯ãããŒã®æåã«ããéæ³ã®æã¯ãœãªã¥ãŒã·ã§ã³ããã°ã§ããç·šéã¢ãŒãã«å ¥ããšãéæ³ã®æãã©ããªãããããããŸãã

ãããŠæåŸã«ãç§ã¯èšããŸã...
å人çã«ãHUEã¯Hadoopãšã³ã·ã¹ãã ã§ã®äœæ¥ãããå¿«é©ã«ããŸããã ãã ãããã®æ©èœã®ååã¯äœ¿çšããŠããŸããã ä»ã®HUEãã³ãã«ã«ã€ããŠã¯äœãèšããŸããã§ãããã·ã§ã«ãšã®ãã³ãã«ã®ããã«éåžžã«åçŽã§ãããã䜿çšããã«äœãèšããªãããã§ãã ãŸããã¯ãšãªãšãã£ã¿ãŒãã¡ã¿ã¹ãã¢ãããŒãžã£ãŒãæ€çŽ¢ã«ã€ããŠã話ããŸããã§ããã ãã®äžçã®ç¥èã«ãé¢å¿ããå¯ãããã ããããããšãããããŸãã