æè¿ãOpenDataScienceãšMail.Ru Groupã¯ããªãŒãã³ãªæ©æ¢°åŠç¿ã³ãŒã¹ãå®æœããŸããã ååã®çºè¡šã§ã¯ ãã³ãŒã¹ã«ã€ããŠå€ãã®ããšãèšãããŸããã ãã®èšäºã§ã¯ãã³ãŒã¹è³æãå ±æãããšãšãã«ãæ°ããããŒã³ããçºè¡šããŸãã
UPDïŒçŸåšãã³ãŒã¹ã¯è±èªã§ã mlcourse.aiãšãããã©ã³ãåã§ãMedium ã«é¢ããèšäº ãKaggleïŒ Dataset ïŒããã³GitHubã«é¢ããè³æããããŸã ã
åŸ ãŠãªã人ïŒã³ãŒã¹ã®æ°ããç«ã¡äžãã¯2æ1æ¥ã§ããç»é²ã¯å¿ èŠãããŸããããç§ãã¡ã¯ããªããèŠããŠãå¥ã ã«æåŸ ããŠã ãã©ãŒã ã«èšå ¥ããŠãã ããã ãã®ã³ãŒã¹ã¯ãHabréã®äžé£ã®èšäºã§æ§æããïŒ Pandasã§ã®æåã®ããŒã¿åæãæåã§ãïŒã YouTubeãã£ã³ãã«ã§ã®è¬çŸ©ãåçŸå¯èœãªè³æïŒã³ãŒã¹ã®githubãªããžããªã®JupyterããŒãïŒã宿é¡ãKaggle Inclassã³ã³ãã¹ãããã¥ãŒããªã¢ã«ãåå¥ãããžã§ã¯ãããŒã¿åæã ã¡ã€ã³ãã¥ãŒã¹ã¯VKontakte ã°ã«ãŒãã«ãããã³ãŒã¹äžã®ç掻ã¯ãã£ã³ãã«#mlcourse_aiã® Slack OpenDataScienceïŒ join ïŒã«è¡šç€ºãããŸãã
èšäºã®æŠèŠ
- ã³ãŒã¹ãšä»ã®ã³ãŒã¹ãšã®éã
- ã³ãŒã¹è³æ
- æ°ãããªãªãŒã¹ã®è©³çŽ°ãã芧ãã ããã
ã³ãŒã¹ã¯ä»ã®ã³ãŒã¹ãšã©ãéããŸãã
1.åå¿è åãã§ã¯ãããŸãã
å€ãã®å Žåã圌ãã¯ããªãã«äœãå¿ èŠãšãããªãããšãæããŠãããŸãã2ã3ã¶æã§ããªãã¯ããŒã¿åæã®å°é家ã«ãªãã§ãããã Andrew Ngã®åºæ¬ã³ãŒã¹ãMachine Learningãã®ãã¬ãŒãºãä»ã§ãèŠããŠããŸãããå°é¢æ°ãäœã§ããããç¥ãå¿ èŠã¯ãããŸãããæ©æ¢°åŠç¿ã§æé©åã¢ã«ãŽãªãºã ãã©ã®ããã«æ©èœããããç解ã§ããŸããã ãŸãã¯ããããªãã¯ãã§ã«ã»ãšãã©ããŒã¿åæã®å°é家ã§ãããªã©ã ææã倧ãã«å°æ¬ããŸã-ããã¯é£ããããŒã±ãã£ã³ã°ãšé»undã§ãã ãã¿ã³ãšç·åœ¢ä»£æ°ã®åºç€ã§ããå°é¢æ°ã®ç¥èããªããã°æé©åãç解ã§ããŸããïŒ ã»ãšãã©ã®å Žåãããã€ãã®ã³ãŒã¹ïŒç§ãã¡ã®ã³ãŒã¹ãå«ãïŒãä¿®äºãããšãããã«ããŒã¿ãµã€ãšã³ãã£ã¹ãã«ãªãããšãããããŸããã ããã¯ç°¡åã§ã¯ãããŸããããããŠããªãã®åå以äžã¯çŽ3-4é±éã§èœã¡ãã§ãããã ããªããæãã§ããããæ°åŠãšããã°ã©ãã³ã°ã«æ²¡é ããæºåãã§ããŠããªãå Žåã¯ãæ°åŒã§æ©æ¢°åŠç¿ã®çŸãããèŠãŠãæ°åè¡ãšæ°çŸè¡ã®ã³ãŒããå°å·ããŠçµæãéæããŠãã ãã-ããªãã¯ããã«ããŸããã ããããããã§ãã¹ãŠåãããšãé¡ã£ãŠããŸãã
äžèšã«é¢é£ããŠãå ¥éã®ãããå€-åºæ¬çãªïŒãããæªããªãïŒã¬ãã«ã§ã®é«çæ°åŠã®ç¥èãšPythonã®åºæ¬ã®ææã瀺ããŸãã ãŸã æºåããŠããªãå Žåã®æºåæ¹æ³ã«ã€ããŠã¯ãVKontakteã°ã«ãŒãã§è©³ãã説æããŸããããã§ã¯ããã¿ãã¬ã®äžã«ãããŸãã ååãšããŠãæ°åŠãªãã§ã³ãŒã¹ãåè¬ã§ããŸããã次ã®å³ãåç §ããŠãã ããã ãã¡ãããç§åŠè ãæ°åŠãç¥ãå¿ èŠãããæ¥ä»ã¯ããªããŒã§ãããããã§ã¯Andrei Karpatyã®åŽã«ããŸã ã ã¯ããããªã㯠backprop ãç解ããå¿ èŠããããŸã ã ãŸããæ°åŠããŸã£ãããªããã°ãããŒã¿ãµã€ãšã³ã¹ã¯ããã«ã§ãœãŒããããããªãã®ã§ããåé¡ã解決ããããšã¯ã§ããŸãããããè¯ããããéããããã¹ããŒãã«è¡ãããšãã§ããŸãã ãã¡ãããæ°åŠããªããã°ãæå 端æè¡ã«å°éããããããèŠãããšã¯éåžžã«ãšããµã€ãã£ã³ã°ã§ãã
æ°åŠ
- è¿ éã§ããã°ãã³ãŒã¹ã©ã®YandexãšMIPTã®å°éåéã®ã¡ã¢ã確èªã§ããŸãïŒèš±å¯ãåŸãŠå ±æããŠããŸãïŒã
- åé¡ã«åŸ¹åºçã«åãçµãå ŽåãMIT Open Coursewareãžã®ãªã³ã¯ã¯1ã€ã§ååã§ãã ãã·ã¢èªã§ã®ã¯ãŒã«ãªãœãŒã¹ã¯ãããã±ãŒåŠéšWikiããŒãžã§ãã ããããç§ã¯MIPT ããã°ã©ã 2ã³ãŒã¹ãåè¬ããã¡ã€ã³ã®ã¿ã¹ã¯ããã¯ãèªã¿ãŸãããæå°éã®çè«ãšå€ãã®å®è·µããããŸãã
- ãããŠãã¡ãããè¯ãæ¬ã«ä»£ãããã®ã¯äœããããŸããïŒããã§ã¯SHADããã°ã©ã ã«èšåã§ããŸãïŒã
- æ°åŠçåæ-ã¯ããªã£ããã§ã;
- ç·åœ¢ä»£æ°-ã³ã¹ããªãã³;
- æé©å-ãã€ãïŒè±èªïŒ;
- 確ççè«ãšçµ±èš-ãããºã³ã
Python
- ã¯ã€ãã¯ãªãã·ã§ã³ã¯ãCodeAcademyãDatacampãDataquestãªã©ã®ãã©ãŠã¶ãã¥ãŒããªã¢ã«ã§ã ãããã«ãªããžããªãæå®ã§ããŸã ã
- 培åº-ããšãã°ãCoureraã®Mailra ã³ãŒã¹ãMIT-shny ã³ãŒã¹ ãPythonã䜿çšããã³ã³ãã¥ãŒã¿ãŒãµã€ãšã³ã¹ãšããã°ã©ãã³ã°ã®çŽ¹ä»ãã
- äžçŽã¬ãã«-ãµã³ã¯ãããã«ãã«ã¯ã³ã³ãã¥ãŒã¿ãŒãµã€ãšã³ã¹ã»ã³ã¿ãŒã³ãŒã¹ ã
2. çè«ãš å®è·µçè«ãšå®è·µ
å€ãã®æ©æ¢°åŠç¿ã³ãŒã¹ããããå°éåéïŒãæ©æ¢°åŠç¿ãšããŒã¿åæããªã©ïŒããããŸãããå€ãã¯æ¥µç«¯ããããã®ã®1ã€ã«ãªããŸãã ã
æé©ãªæ¯çãæ¢ããŠããŸãïŒHabréã®èšäºã«ã¯å€ãã®çè«ããããŸãïŒç·åœ¢ã¢ãã«ã«é¢ãã第4ã®èšäºã¯ææšã§ãïŒãå¯èœãªéãæ確ã«æ瀺ããããã«åªããè¬çŸ©ã§ããã«äººæ°ããããŸãã ããããæµ·ã®ç·Žç¿-宿é¡ã4ã€ã®Kaggleã³ã³ãã¹ãããããžã§ã¯ããªã©ãããã ãã§ã¯ãããŸããã
3.ã©ã€ãã³ãã¥ãã±ãŒã·ã§ã³
ã»ãšãã©ã®ã³ãŒã¹ã§æ¬ ããŠããã®ã¯ãã©ã€ãã³ãã¥ãã±ãŒã·ã§ã³ã§ãã åå¿è ã¯ãæ°æéãããã«ã¯æ°åæéãã®æéãç¯çŽããããã«ããã£ã1ã€ã®çãã¢ããã€ã¹ãå¿ èŠãšããå ŽåããããŸãã Courseraãã©ãŒã©ã ã¯éåžžãããæç¹ã§æ¶æ» ããŸãã ç§ãã¡ã®ã³ãŒã¹ã®ãŠããŒã¯ãã¯ãç©æ¥µçãªã³ãã¥ãã±ãŒã·ã§ã³ãšçžäºæ¯æŽã®é°å²æ°ã§ãã ã³ãŒã¹äžãSlack OpenDataScienceã¯ãããã質åããµããŒãããŸãããã£ããã¯çãçããšæé·ããŸããç¬èªã®ãŠãŒã¢ã¢ãããã誰ãã誰ããèãããŸãã
4. Kaggleã®åäœ
VKontakteã®å
¬é ãæ人ç·æ§åãã®æ©æ¢°åŠç¿ã«é¢ããããŒã ãããã
Kaggleã³ã³ããã£ã·ã§ã³ã¯ãããŒã¿ãã€ãã³ã°ã®å®è·µããã°ããäœéšããããã®åªããæ¹æ³ã§ãã éåžžã圌ãã¯åºæ¬çãªæ©æ¢°åŠç¿ã³ãŒã¹ãåè¬ããåŸã圌ãã«åå ãå§ããŸãïŒååãšããŠãAndrew Ngã³ãŒã¹ã¯ãèè ã¯ç¢ºãã«ã«ãªã¹ãæ§ããããéåžžã«ããŸã話ããŸãããã³ãŒã¹ã¯ãã§ã«éåžžã«æ代é ãã§ãïŒ ã³ãŒã¹ã®ã³ãŒã¹ã§ã¯ã4ã€ã®ã³ã³ããã£ã·ã§ã³ã«åå ããããæåŸ ãããŸãããã®ãã¡2ã€ã¯å®¿é¡ã®äžéšã§ãããã¢ãã«ããç¹å®ã®çµæãéæããå¿ èŠããããŸããä»ã®2ã€ã¯ãã§ã«æ¬æ Œçãªã³ã³ããã£ã·ã§ã³ã§ãããåäœæïŒãµã€ã³ã¢ãããã¢ãã«ã®éžæïŒããã³ãªãŒããŒãã€ã¯ãå¿ èŠã§ã仲éã
5.ç¡æ
ãŸãããããéèŠãªèŠçŽ ã§ããããã¯ãã§ã«ããã«ãããŸãã ããŠãæ©æ¢°åŠç¿ã®æ®åã«äŒŽããéåžžã«å¹ åºãå ±é ¬ãåŸãããã®æè²ãæäŸããå€ãã®ã³ãŒã¹ãèŠã€ãããŸãã ãããŠãããã§ã¯ãã¹ãŠãç¡æã§ãããåœãã®è¬èãããªããéåžžã«ãŸãšããªã¬ãã«ã§ãã
ã³ãŒã¹è³æ
ããã§ã¯ãã³ãŒã¹ã®10ã®ãããã¯ãããããäœã«å°å¿µããŠããã®ããåºæ¬çãªæ©æ¢°åŠç¿ã³ãŒã¹ãããããªãã§ã¯ã§ããªãçç±ãããã³å°å ¥ãããæ°ããäºé ã«ã€ããŠç°¡åã«èª¬æããŸãã
ãããã¯1. Pandasã䜿çšããåæããŒã¿åæã Habréã«é¢ããèšäº
ããã«æ©æ¢°åŠç¿ããå§ããŠãå®éã®æ°åŠãèŠãŠã¿ãŸãããã ããããå®éã®ãããžã§ã¯ãã§äœæ¥ããæéã®70ã80ïŒ ã¯ããŒã¿ã«ç ©ããããŠãããããã§ã¯Pandasãéåžžã«åªããŠããã®ã§ãã»ãŒæ¯æ¥ä»äºã§äœ¿çšããŠããŸãã ãã®èšäºã§ã¯ãäžæ¬¡ããŒã¿åæã®ããã®ãã³ãã®åºæ¬çãªæ¹æ³ã«ã€ããŠèª¬æããŸãã 次ã«ãéä¿¡äºæ¥è ã®é¡§å®¢ã®æµåºã«é¢ããããŒã¿ã»ãããåæããåã«åžžèã«é Œã£ãŠããã¬ãŒãã³ã°ãªãã§æµåºããäºæž¬ãããããšããŸãã ãã®ã¢ãããŒããéå°è©äŸ¡ããŠã¯ãããŸããã
ãããã¯2. Pythonã«ããããžã¥ã¢ã«ããŒã¿åæã Habréã«é¢ããèšäº
èŠèŠçãªããŒã¿åæã®åœ¹å²ãé倧è©äŸ¡ããããšã¯å°é£ã§ããããããæ°ããå åã®äœææ¹æ³ãããŒã¿ã®ãã¿ãŒã³ãšæŽå¯ã®æ€çŽ¢æ¹æ³ã§ãã K.V. Vorontsovã¯ãèŠèŠåã®ãããã§ãããŒã¹ãäžã«ããªãŒãè¿œå ãããã«ã€ããŠã¯ã©ã¹ããé¢ããŠãããããšãèªèãããã®äºå®ãçè«çã«èšŒæãããäŸã瀺ããŠããŸãã è¬çŸ©ã§ã¯ãæšèã®åæã®ããã«éåžžäœæãããäž»ãªã¿ã€ãã®åçã«ã€ããŠæ€èšããŸãã ãŸããäžè¬çãªå€æ¬¡å 空éãèŠãæ¹æ³ã«ã€ããŠã説æããŸããt-SNEã¢ã«ãŽãªãºã ã䜿çšãããšããã®ãããªã¯ãªã¹ãã¹ããªãŒã®è£ 食ãæãã®ã«åœ¹ç«ã¡ãŸãã
ããŒã3.åé¡ã決å®æšãããã³æè¿åã®æ¹æ³ã
Habréã«é¢ããèšäº
ããã§ã¯ãæ©æ¢°åŠç¿ãšãåé¡åé¡ã解決ãã2ã€ã®ç°¡åãªã¢ãããŒãã«ã€ããŠèª¬æããŸãã ç¹°ãè¿ããŸãããå®éã®ãããžã§ã¯ãã§ã¯ãæãåçŽãªã¢ãããŒãããå§ããå¿ èŠãããããã¥ãŒãªã¹ãã£ãã¯ã®åŸã«æåã«è©Šãå¿ èŠãããã®ã¯ã決å®æšãšæè¿åæ³ïŒããã³ç·åœ¢ã¢ãã«ã次ã®ãããã¯ïŒã§ãã ã¢ãã«ã®å質è©äŸ¡ãšçžäºæ€èšŒã®éèŠãªåé¡ã«è§ŠããŸãã æšã®é·æãšçæãããã³æè¿åã®æ¹æ³ã«ã€ããŠè©³ãã説æããŸãã ãã®èšäºã¯é·ããªããŸãããç¹ã«ææ決å®ããªãŒã«æ³šç®ããå¿ èŠããããŸã-ã©ã³ãã ãã©ã¬ã¹ããšããŒã¹ãã£ã³ã°ãæ§ç¯ãããã®ã¯ããããã®åºç€ã«åºã¥ããŠãã-å®éã«äœ¿çšããå¯èœæ§ãæãé«ãã¢ã«ãŽãªãºã ã§ãã
ããŒã4.åé¡ãšååž°ã®ç·åœ¢ã¢ãã«ã
Habréã«é¢ããèšäº
ãã®èšäºã¯æ¢ã«å°ããªãã³ãã¬ããã®ãµã€ãºã«ãªããŸããããã®çç±ã¯ååã«ãããŸããç·åœ¢ã¢ãã«ã¯ãå®éã®äºæž¬ã§æãåºã䜿çšãããŠããã¢ãããŒãã§ãã ãã®èšäºã¯ãç§ãã¡ã®ãããã¥ã¢ã³ãŒã¹ã®ãããªãã®ã§ããå€ãã®çè«ãå€ãã®å®è·µã§ãã æå°äºä¹æ³ãšããžã¹ãã£ãã¯ååž°ã®çè«çèæ¯ãããã³ç·åœ¢ã¢ãã«ã®å®çšåã®å©ç¹ã«ã€ããŠèª¬æããŸãã ãŸããé床ã®çè«åã¯è¡ãããªãããšã«æ³šæããŠãã ãã;æ©æ¢°åŠç¿ã«ãããç·åœ¢ã¢ãã«ãžã®ã¢ãããŒãã¯ãçµ±èšçããã³èšéçµæžåŠçææ³ãšã¯ç°ãªããŸãã å®éã«ã¯ã蚪åãããµã€ãã®ã·ãŒã±ã³ã¹ã«ãã£ãŠãŠãŒã¶ãŒãèå¥ãããšããéåžžã«çŸå®çãªã¿ã¹ã¯ã«ããžã¹ãã£ãã¯ååž°ãé©çšããŸãã 4åç®ã®å®¿é¡ã®åŸãå€ãã®äººãè±èœããŸãããããã§ãåãããšãããã°ãå®çšŒåã·ã¹ãã ã§ã©ã®ã¢ã«ãŽãªãºã ã䜿çšãããŠãããã«ã€ããŠæ¢ã«ååã«ç解ã§ããŸãã
ããŒã5.æ§æïŒãã®ã³ã°ãã©ã³ãã ãã©ã¬ã¹ãã Habréã«é¢ããèšäº
ããã§ããçè«ã¯èå³æ·±ããå®è·µçã§ãã ã矀è¡ã®ç¥æµããæ©æ¢°åŠç¿ã¢ãã«ã§æ©èœããçç±ã説æããŸããå€ãã®ã¢ãã«ã1ã€ãããåªããŠãããæé«ã®ã¢ãã«ã§ãæ©èœããŸãã ããããå®éã«ã¯ãã©ã³ãã ãã©ã¬ã¹ãïŒå€ãã®æ±ºå®ããªãŒã®æ§æïŒãåžããŸã-ã©ã®ã¢ã«ãŽãªãºã ãéžæãã¹ããããããªãå Žåã¯ãè©ŠããŠã¿ã䟡å€ããããŸãã ã©ã³ãã ãã©ã¬ã¹ãã®å€ãã®å©ç¹ãšãã®ç¯å²ã«ã€ããŠè©³ãã説æããŸãã ãããŠããã€ãã®ããã«ãæ¬ ç¹ããªãããã§ã¯ãããŸãããç·åœ¢ã¢ãã«ãããè¯ããããéãåäœããç¶æ³ããŸã ãããŸãã
ããŒã6.æšèã®äœæãšéžæã ããã¹ããç»åããžãªããŒã¿ã®åŠçã¿ã¹ã¯ã®ã¢ããªã±ãŒã·ã§ã³ã Habréã«é¢ããèšäº ãååž°ãšæ£ååã«é¢ããè¬çŸ©ã
ããã§ã¯ãèšäºãšè¬çŸ©ã®èšç»ãå°ãç°ãªããŸãïŒäžåºŠã ãïŒãç·åœ¢ã¢ãã«ã®4çªç®ã®ãããã¯ã¯å€§ããããŸãã ãã®èšäºã§ã¯ãæ©æ¢°åŠç¿ã¢ãã«ã®æ©èœã®æœåºãå€æãæ§ç¯ã«å¯Ÿããäž»ãªã¢ãããŒãã«ã€ããŠèª¬æããŸãã äžè¬ã«ããã®ã¬ãã¹ã³ã§ããæšèã®äœæã¯ãããŒã¿ãµã€ãšã³ãã£ã¹ãã®ä»äºã®æãåµé çãªéšåã§ãã ãããŠãã¡ãããæ¢è£œã®PandasããŒã¿ãã¬ãŒã ã ãã§ãªããããŸããŸãªããŒã¿ïŒããã¹ããç»åããžãªããŒã¿ïŒãæäœããæ¹æ³ãç¥ãããšãéèŠã§ãã
è¬çŸ©ã§ã¯ãç·åœ¢ã¢ãã«ãšãMLã¢ãã«ã®è€éããèšå®ããããã®äž»ãªææ³ã§ããæ£ååã«ã€ããŠå床説æããŸãã ããã£ãŒãã©ãŒãã³ã°ããšããæ¬ã¯ãããç¥ãããŠããåå¿ïŒèšŒæãªã³ã¯ãç»ãã®ãé¢åïŒãæããŠãããäžè¬ã«ããã¹ãŠã®æ©æ¢°åŠç¿ã¯æ£ååã®æ¬è³ªãã§ãããšäž»åŒµããŠããŸãã ãã¡ããããã¯èªåŒµã§ãããå®éã«ã¯ãã¢ãã«ãããŸãæ©èœããããã«ã¯ãã¢ãã«ã調æŽããå¿ èŠããããŸããã€ãŸããæ£ååã䜿çšããã®ãé©åã§ãã
ããŒã7.æåž«ãªãã®æè²ïŒPCAãã¯ã©ã¹ã¿ãªã³ã°ã Habréã«é¢ããèšäº
ããã§ã¯ãæåž«ãªãã§æãããšããåºå€§ãªãããã¯ã«ç®ãåããŸããããã¯ããŒã¿ããããšãã§ãããäºæž¬ãããã¿ãŒã²ããå±æ§ã¯ãããŸããã ãã®ãããªæªå²ãåœãŠããŒã¿ã¯1ããŒã¹ã§ããããããããå©çãåŸãããšãã§ããå¿ èŠããããŸãã ã¯ã©ã¹ã¿ãªã³ã°ãšæ¬¡å åæžã®2çš®é¡ã®ã¿ã¹ã¯ã«ã€ããŠã®ã¿èª¬æããŸãã 宿é¡ã§ã¯ãæºåž¯é»è©±ã®å é床èšãšãžã£ã€ãã¹ã³ãŒãããã®ããŒã¿ãåæãããããã«é»è©±ãã£ãªã¢ãã¯ã©ã¹ã¿ãŒåããŠãã¢ã¯ãã£ããã£ã®çš®é¡ã匷調ããŸãã
ãããã¯8. Vowpal Wabbitã䜿çšããã®ã¬ãã€ãã®ãã¬ãŒãã³ã°ã Habréã«é¢ããèšäº
ããã§ã®çè«ã¯ç¢ºççåŸé éäžæ³ã®åæã§ããããã®æé©åææ³ã«ããã倧èŠæš¡ãªãã¬ãŒãã³ã°ãµã³ãã«ã§ãã¥ãŒã©ã«ãããã¯ãŒã¯ãšç·åœ¢ã¢ãã«ã®äž¡æ¹ãæ£åžžã«ãã¬ãŒãã³ã°ããããšãå¯èœã«ãªããŸããã ããã§ã¯ãå åãå€ãããå Žåã®å¯ŸåŠæ¹æ³ïŒå±æ§ã®å€ãããã·ã¥ããããªãã¯ïŒã«ã€ããŠèª¬æããæ°åã§ã®ã¬ãã€ãã®ããŒã¿ã§ã¢ãã«ããã¬ãŒãã³ã°ã§ãããŠãŒãã£ãªãã£ã§ããVowpal Wabbitã«ç§»åããŸãã çãããã¹ãã®åé¡ãããã³StackOverflowã«é¢ãã質åã®åé¡ãªã©ãããŸããŸãªã¿ã¹ã¯ã®å€ãã®ã¢ããªã±ãŒã·ã§ã³ãæ€èšããŠãã ããã ãããŸã§ã®ãšããããã®ç¹å®ã®èšäºã®ç¿»èš³ ïŒKaggle Kernelã®åœ¢åŒïŒã¯ãè±èªããäžçšåºŠã®çŽ æãæåºããæ¹æ³ã®äŸãšããŠåœ¹ç«ã¡ãŸãã
ãããã¯9. Pythonã䜿çšããæç³»ååæã Habréã«é¢ããèšäº
ããã§ã¯ãã¢ãã«ã«å¿ èŠãªããŒã¿æºåã®æ®µéãçæããã³é·æã®äºæž¬ãååŸããæ¹æ³ãªã©ãæç³»åãæ±ãããŸããŸãªæ¹æ³ã«ã€ããŠèª¬æããŸãã åçŽãªç§»åå¹³åããåŸé ããŒã¹ãã£ã³ã°ãŸã§ãããŸããŸãªã¿ã€ãã®ã¢ãã«ãèŠãŠãããŸãããã ãŸããæç³»åã§ç°åžžãæ€çŽ¢ããæ¹æ³ãæ€èšãããããã®æ¹æ³ã®å©ç¹ãšæ¬ ç¹ã«ã€ããŠèª¬æããŸãã
ããŒã10.åŸé ããŒã¹ãã£ã³ã°ã Habréã«é¢ããèšäº
ããŠãåŸé ããŒã¹ããªãã®å Žå...ããã¯MatrixnetïŒYandexæ€çŽ¢ãšã³ãžã³ïŒãããã³Catboost-Yandexã®æ°äžä»£ããŒã¹ãã£ã³ã°ãããã³æ€çŽ¢ãšã³ãžã³Mail.Ruã§ãã ããŒã¹ãã£ã³ã°ã¯ãæåž«ã«ããæè²ã®3ã€ã®åºæ¬ã¿ã¹ã¯ïŒåé¡ãååž°ãã©ã³ãã³ã°ïŒããã¹ãŠè§£æ±ºããŸãã äžè¬ã«ãç§ã¯ãããæè¯ã®ã¢ã«ãŽãªãºã ãšåŒã³ãããšæããŸããããã¯çå®ã«è¿ãã§ãããããè¯ãã¢ã«ãŽãªãºã ã¯ãããŸããã ãã ããããŒã¿ãå€ãããïŒRAMã«åãŸãïŒãããŸãå€ãã®å åïŒæ°åãŸã§ïŒããªããå åãç°è³ªïŒã«ããŽãªãå®éããã€ããªãªã©ïŒã§ããå ŽåãKaggle競åã®çµéšã瀺ãããã«ã ã»ãŒç¢ºå®ã«ãåŸé ããŒã¹ãã£ã³ã°ãããªãã®ã¿ã¹ã¯ã«æé©ã§ãã ãããã£ãŠãXgboostãLightGBMãCatboostãH2Oãªã©ãéåžžã«å€ãã®ã¯ãŒã«ãªå®è£ ãç»å Žããã®ã¯çç±ããªãã£ãããã§ã¯ãããŸãã...
ç¹°ãè¿ãã«ãªããŸããããXxBustã®ãã¥ãŒãã³ã°æ¹æ³ãããã¥ã¢ã«ã«éå®ãããããšã¯ãããŸããããããŒã¹ãã®çè«ã詳现ã«æ€èšããå®éã«æ€èšããŠãCatboostã®è¬çŸ©ã§åãäžããŸãã ããã§ã®ã¿ã¹ã¯ã¯ã競äºã®ããŒã¹ã©ã€ã³ãç Žãããšã§ããããã«ãããå€ãã®å®éçãªåé¡ã§æ©èœããæ¹æ³ã®è¯ãã¢ã€ãã¢ãåŸãããŸãã
æ°ãããªãªãŒã¹ã®è©³çŽ°ãã芧ãã ããã
ã³ãŒã¹ã¯2018幎2æ5æ¥ã«å§ãŸããŸãã ã³ãŒã¹æéäžïŒ
- Mail.Ru Groupã®ã¢ã¹ã¯ã¯äºåæã§ã®æææ¥19ïŒ00-22.00ã®ã©ã€ãã¬ã¯ãã£ãŒã è¬çŸ©ã®ãããªé²ç»ã¯åãã§ã ïŒyoutubeïŒããã³ã¡ã³ããè¿œå ãæ¹åããããŸãã
- Habrã«é¢ããèšäºã¯å€ãã ãããæåã§ãã èšäºã¯çŸåšã®å®¿é¡ãšãããã®ç· ãåããçºè¡šããæ å ±ã¯VKontakte ã°ã«ãŒããšSlack OpenDataScienceã®#mlcourse_aiãã£ã³ãã«ã§è€è£œãããŸãã
- ã³ã³ãã¹ãããããžã§ã¯ãããã¥ãŒããªã¢ã«ããã®ä»ã®ã¢ã¯ãã£ããã£ã«ã€ããŠã¯ã ãã®èšäºãšã³ãŒã¹ãªããžããªã§èª¬æããŠã ãŸã ã
- ãŸããé±ã«1åãMediumã§è±èªã®èšäºãå ¬éããŸãã ããã¯ãVowpal Wabbitã«ã€ããŠã®ãã® Kaggleã«ãŒãã«ã«äŒŒãŠããŸãããMediumã®ã¿ã§ãã
- 4æ23æ¥ãã7æ15æ¥ãŸã§ããã¥ãŒã©ã«ãããã¯ãŒã¯ã«é¢ããå ±åã®ã¹ã¿ã³ãã©ãŒãcs231n ã³ãŒã¹ãèšç»ãããŠããŸãïŒè©³çŽ°ã«ã€ããŠã¯ãODSã¹ã©ãã¯ã®ãã£ãã«ïŒ class_cs231nã®åºå®é ç®ãåç §ããŠãã ããïŒã ããã¯2åç®ã®æã¡äžãã«ãªããŸãããä»ã¯ã¡ããã©é²ãã§ãããã³ãŒã¹ã¯çŽ æŽãããã§ãã宿é¡ã¯é£ãããé¢çœããéåžžã«äŸ¿å©ã§ãã
ã³ãŒã¹ãžã®æ¥ç¶æ¹æ³
æ£åŒãªç»é²ã¯å¿ èŠãããŸããã 宿é¡ãããŠã競æäŒã«åå ããã ãã§ãã©ã³ãã³ã°ã§ããªããèæ ®ããŸãã ããã§ãã ãã®èª¿æ»ã«èšå ¥ãããšãã³ãŒã¹äžã«å·Šã®ã¡ãŒã«ãããªãã®IDã«ãªããŸããåæã«ããã€ã³ãã«è¿ãã¹ã¿ãŒããæãåºãããŸãã
ãã£ã¹ã«ãã·ã§ã³ãã©ãããã©ãŒã
- Slack OpenDataScienceã®#mlcourse_aiãã£ãã«ã ããã§ã®äž»ãªé£çµ¡ã¯ã質åãããããšãã§ããŸãã åãæ-èšäºã宿é¡ã®èè ããã®ãã£ã³ãã«ã«ããŸããçããæºåãã§ããŠããŸãã ãããã措氎ãå€ãã®ââã§ã質åããåã«åºå®ãããã¢ã€ãã ãèŠãŠãã ããã
- VKontakte ã°ã«ãŒã ã ãã®å£ã¯å ¬åŒçºè¡šã«äŸ¿å©ãªå Žæã§ãã
é 匵ã£ãŠ æåŸã«ãç§ã¯ãã¹ãŠãå€æããããšãäž»ãªããšãèšããã-çµäºããªãã§ãã ããïŒ ãã®ãæããŠã¯ãããªããããªãã¯ä»äžç®ãèµ°ããããããæ°ã¥ããªãã£ãã ããããèããŠã¿ãŠãã ããïŒãããäž»ãªããšã§ãã