浠婃棩Nature: 浜哄伐鏅鸿兘浠?鍒?, 鏃犲笀鑷€氬畬鐖嗛樋娉曠嫍100-0|娣卞害瑙f瀽

杩欓噷鏄爣棰樹竴h1鍗犱綅鏂囧瓧


銆€銆€鍘诲勾锛屾湁涓皬瀛╄閬嶄汉涓栨墍鏈夌殑妫嬭氨锛岃緵鍕ゆ墦璋憋紝鑻︽€濆啣鎯筹紝妫嬭壓绮捐繘锛?-1鎵撹触涓栫晫鍐犲啗鏉庝笘鐭筹紝浠庢浜洪棿鏃犳晫鎵嬨€備粬鐨勫悕瀛楀彨闃挎硶鐙椼€侟/div>
銆€銆€浠婂勾锛屼粬鐨勫紵寮熷彧闈犱竴鍓鐩樺拰榛戠櫧涓ゅ瓙锛屾病鐪嬭繃涓€涓璋憋紝涔熸病鏈変竴涓汉鎸囩偣锛屼粠闆跺紑濮嬶紝鑷ū鑷箰锛岃嚜宸卞弬鎮燂紝100-0鎵撹触鍝ュ摜闃挎硶鐙椼€備粬鐨勫悕瀛楀彨闃挎硶鍏冦€侟/div>
銆€銆€DeepMind杩欓」浼熷ぇ鐨勭獊鐮达紝浠婂ぉ浠astering the game of Go without human knowledge涓洪锛屽彂琛ㄤ簬Nature锛屽紩璧疯桨鍔ㄣ€傜煡绀剧壒閭€鍥藉唴澶栧嚑浣嶄汉宸ユ櫤鑳戒笓瀹讹紝缁欎簣娣卞害瑙f瀽鍜岀偣璇勩€傛枃鏈湁DeepMind David Silver鍗氬+涓撹瑙嗛銆傜壒鍒嚧璋ature鍜孌eepMind鎻愪緵璁伅鍜岃祫鏂欐巿鏉冦€侟/div>
 
 
銆€銆€Nature浠婂ぉ涓婄嚎鐨勮繖绡囬噸纾呰鏂囷紝璇︾粏浠嬬粛浜嗚胺姝孌eepMind鍥㈤槦鏈€鏂扮殑鐮旂┒鎴愭灉銆備汉宸ユ櫤鑳界殑涓€椤归噸瑕佺洰鏍囷紝鏄湪娌℃湁浠讳綍鍏堥獙鐭ヨ瘑鐨勫墠鎻愪笅锛岄€氳繃瀹屽叏鐨勮嚜瀛︼紝鍦ㄦ瀬鍏锋寫鎴樼殑棰嗗煙锛岃揪鍒拌秴浜虹殑澧冨湴銆傚幓骞达紝闃挎硶鐙楋紙AlphaGo锛変唬琛ㄤ汉宸ユ櫤鑳藉湪鍥存棰嗗煙棣栨鎴樿儨浜嗕汉绫荤殑涓栫晫鍐犲啗锛屼絾鍏舵鑹虹殑绮捐繘锛屾槸寤虹珛鍦ㄨ绠楁満閫氳繃娴烽噺鐨勫巻鍙叉璋卞涔犲弬鎮熶汉绫绘鑹虹殑鍩虹涔嬩笂锛岃繘鑰岃嚜鎴戣缁冿紝瀹炵幇瓒呰秺銆侟/div>
銆€銆€
 
闃垮皵娉曞厓妫嬪姏鐨勫闀夸笌绉垎姣擖/div>
銆€銆€鍙槸浠婂ぉ锛屾垜浠彂鐜帮紝浜虹被鍏跺疄鎶婇樋娉曠嫍鏁欏潖浜嗭紒鏂颁竴浠g殑闃挎硶鍏?AlphaGo Zero),瀹屽叏浠庨浂寮€濮嬶紝涓嶉渶瑕佷换浣曞巻鍙叉璋辩殑鎸囧紩锛屾洿涓嶉渶瑕佸弬鑰冧汉绫讳换浣曠殑鍏堥獙鐭ヨ瘑锛屽畬鍏ㄩ潬鑷繁涓€涓汉寮哄寲瀛︿範锛坮einforcement learning锛夊拰鍙傛偀,妫嬭壓澧為暱杩滆秴闃挎硶鐙楋紝鐧炬垬鐧捐儨锛屽嚮婧冮樋娉曠嫍100-0銆侟/div>
銆€銆€杈惧埌杩欐牱涓€涓按鍑嗭紝闃挎硶鍏冨彧闇€瑕佸湪4涓猅PU涓婏紝鑺变笁澶╂椂闂达紝鑷繁宸﹀彸浜掓悘490涓囨灞€銆傝€屽畠鐨勫摜鍝ラ樋娉曠嫍锛岄渶瑕佸湪48涓猅PU涓婏紝鑺卞嚑涓湀鐨勬椂闂达紝瀛︿範涓夊崈涓囨灞€锛屾墠鎵撹触浜虹被銆侟/div>
 
 
銆€銆€杩欑瘒璁烘枃鐨勭涓€鍜岄€氳浣滆€呮槸DeepMind鐨凞avid Silver鍗氬+,闃挎硶鐙楅」鐩礋璐d汉銆備粬浠嬬粛璇撮樋娉曞厓杩滄瘮闃挎硶鐙楀己澶э紝鍥犱负瀹冧笉鍐嶈浜虹被璁ょ煡鎵€灞€闄愶紝鑰岃兘澶熷彂鐜版柊鐭ヨ瘑锛屽彂灞曟柊绛栫暐锛欬/div>
銆€銆€This technique is more powerful than previous versions of AlphaGo because it is no longer constrained by the limits of human knowledge.Instead,it is able to learn tabula rasa from the strongest player in the world:AlphaGo itself.AlphaGo Zero also discovered new knowledge,developing unconventional strategies and creative new moves that echoed and surpassed the novel techniques it played in the games against Lee Sedol and Ke Jie.
 
 
銆€銆€DeepMind鑱斿悎鍒涘浜哄拰CEO鍒欒杩欎竴鏂版妧鏈兘澶熺敤浜庤В鍐宠濡傝泲鐧借川鎶樺彔鍜屾柊鏉愭枡寮€鍙戣繖鏍风殑閲嶈闂锛欬/div>
銆€銆€AlphaGo Zero is now the strongest version of our program and shows how much progress we can make even with less computing power and zero use of human data.Ultimately we want to harness algorithmic breakthroughs like this to help solve all sorts of pressing real world problems like protein folding or designing new materials.
銆€銆€缇庡浗鐨勪袱浣嶆鎵嬪湪Nature瀵归樋娉曞厓鐨勬灞€鍋氫簡鐐硅瘎锛氬畠鐨勫紑灞€鍜屾敹瀹樺拰涓撲笟妫嬫墜鐨勪笅娉曞苟鏃犲尯鍒紝浜虹被鍑犲崈骞寸殑鏅烘収缁撴櫠锛岀湅璧锋潵骞堕潪鍏ㄩ敊銆備絾鏄腑鐩樼湅璧锋潵鍒欓潪甯歌寮傦細
銆€銆€the AI鈥檚 open卢ing choices and end-game methods have converged on ours鈥攕eeing it arrive at our sequences from first principles suggests that we haven鈥檛 been on entirely the wrong track.By contrast,some of its middle-game judgements are truly mysterious.
銆€銆€涓烘洿娣卞叆浜嗚В闃挎硶鍏冪殑鎶€鏈粏鑺傦紝鐭ョぞ閲囪浜嗙編鍥芥潨鍏嬪ぇ瀛︿汉宸ユ櫤鑳戒笓瀹堕檲鎬$劧鏁欐巿銆備粬鍚戠煡绀句粙缁嶈锛欬/div>
銆€銆€DeepMind鏈€鏂版帹鍑虹殑AlphaGo Zero闄嶄綆浜嗚缁冨鏉傚害锛屾憜鑴变簡瀵逛汉绫绘爣娉ㄦ牱鏈?浜虹被鍘嗗彶妫嬪眬)鐨勪緷璧栵紝璁╂繁搴﹀涔犵敤浜庡鏉傚喅绛栨洿鍔犳柟渚垮彲琛屻€傛垜涓汉瑙夊緱鏈€鏈夎叮鐨勬槸璇佹槑浜嗕汉绫荤粡楠岀敱浜庢牱鏈┖闂村ぇ灏忕殑闄愬埗锛屽線寰€閮芥敹鏁涗簬灞€閮ㄦ渶浼樿€屼笉鑷煡锛堟垨鏃犳硶鍙戠幇锛夛紝鑰屾満鍣ㄥ涔犲彲浠ョ獊鐮磋繖涓檺鍒躲€備箣鍓嶅ぇ瀹堕殣闅愮害绾﹁寰楀簲璇ュ姝わ紝鑰岀幇鍦ㄦ槸閾佺殑閲忓寲浜嬪疄鎽嗗湪闈㈠墠锛?/div>
銆€銆€浠栬繘涓€姝ヨВ閲婇亾锛欬/div>
銆€銆€杩欑瘒璁烘枃鏁版嵁鏄剧ず瀛︿範浜虹被閫夋墜鐨勪笅娉曡櫧鐒惰兘鍦ㄨ缁冧箣鍒濊幏寰楄緝濂界殑妫嬪姏锛屼絾鍦ㄨ缁冨悗鏈熸墍鑳借揪鍒扮殑妫嬪姏鍗村彧鑳戒笌鍘熺増鐨凙lphaGo鐩歌繎锛岃€屼笉瀛︿範浜虹被涓嬫硶鐨凙lphaGo Zero鏈€缁堝嵈鑳借〃鐜板緱鏇村ソ銆傝繖鎴栬璇存槑浜虹被鐨勪笅妫嬫暟鎹皢绠楁硶瀵煎悜浜嗗眬閮ㄦ渶浼?local optima)锛岃€屽疄闄呮洿浼樻垨鑰呮渶浼樼殑涓嬫硶涓庝汉绫荤殑涓嬫硶瀛樺湪涓€浜涙湰璐ㄧ殑涓嶅悓锛屼汉绫诲疄闄呪€欒瀵尖€欎簡AlphaGo銆傛湁瓒g殑鏄鏋淎lphaGo Zero鏀惧純瀛︿範浜虹被鑰屼娇鐢ㄥ畬鍏ㄩ殢鏈虹殑鍒濆涓嬫硶锛岃缁冭繃绋嬩篃涓€鐩存湞鐫€鏀舵暃鐨勬柟鍚戣繘琛岋紝鑰屾病鏈変骇鐢熼毦浠ユ敹鏁涚殑鐜拌薄銆侟/div>
銆€銆€闃挎硶鍏冩槸濡備綍瀹炵幇鏃犲笀鑷€氱殑鍛紵鏉滃厠澶у鍗氬+鐮旂┒鐢熷惔鏄ラ箯鍚戠煡绀句粙缁嶄簡鎶€鏈粏鑺傦細
銆€銆€涔嬪墠鎴樿儨鏉庝笘鐭崇殑AlphaGo鍩烘湰閲囩敤浜嗕紶缁熷寮哄涔犳妧鏈啀鍔犱笂娣卞害绁炵粡缃戠粶DNN瀹屾垚鎼缓锛岃€孉lphaGo Zero鍚稿彇浜嗘渶鏂版垚鏋滃仛鍑轰簡閲嶅ぇ鏀硅繘銆侟/div>
銆€銆€棣栧厛锛屽湪AlphaGo Zero鍑虹幇涔嬪墠锛屽熀浜庢繁搴﹀涔犵殑澧炲己瀛︿範鏂规硶鎸夌収浣跨敤鐨勭綉缁滄ā鍨嬫暟閲忓彲浠ュ垎涓轰袱绫胡涓€绫讳娇鐢ㄤ竴涓狣NN"绔埌绔?鍦板畬鎴愬叏閮ㄥ喅绛栬繃绋?姣斿DQN)锛岃繖绫绘柟娉曟瘮杈冭交渚匡紝瀵逛簬绂绘暎鍔ㄤ綔鍐崇瓥鏇撮€傜敤;鍙︿竴绫讳娇鐢ㄥ涓狣NN鍒嗗埆瀛︿範policy鍜寁alue绛?姣斿涔嬪墠鎴樿儨鏉庝笘鐭崇殑AlphaGoGo)锛岃繖绫绘柟娉曟瘮杈冨鏉傦紝瀵逛簬鍚勭鍐崇瓥鏇撮€氱敤銆傛娆$殑AlphaGo Zero缁煎悎浜嗕簩鑰呴暱澶勶紝閲囩敤绫讳技DQN鐨勪竴涓狣NN缃戠粶瀹炵幇鍐崇瓥杩囩▼锛屽苟鍒╃敤杩欎釜DNN寰楀埌涓ょ杈撳嚭policy鍜寁alue锛岀劧鍚庡埄鐢ㄤ竴涓挋鐗瑰崱缃楁悳绱㈡爲瀹屾垚褰撳墠姝ラ閫夋嫨銆侟/div>
銆€銆€鍏舵锛孉lphaGo Zero娌℃湁鍐嶅埄鐢ㄤ汉绫诲巻鍙叉灞€锛岃缁冭繃绋嬩粠瀹屽叏闅忔満寮€濮嬨€傞殢鐫€杩戝嚑骞存繁搴﹀涔犵爺绌跺拰搴旂敤鐨勬繁鍏ワ紝DNN鐨勪竴涓己鐐规棩鐩婃槑鏄晋璁粌杩囩▼闇€瑕佹秷鑰楀ぇ閲忎汉绫绘爣娉ㄦ牱鏈紝鑰岃繖瀵逛簬灏忔牱鏈簲鐢ㄩ鍩?姣斿鍖荤枟鍥惧儚澶勭悊)鏄笉鍙兘鍔炲埌鐨勩€傛墍浠ew-shot learning鍜孴ransfer learning绛夊噺灏戞牱鏈拰浜虹被鏍囨敞鐨勬柟娉曞緱鍒版櫘閬嶉噸瑙嗐€侫lphaGo Zero鏄湪鍙屾柟鍗氬紙璁粌杩囩▼涓皾璇曡В鍐冲浜虹被鏍囨敞鏍锋湰鐨勪緷璧栵紝杩欐槸浠ュ線娌℃湁鐨勩€侟/div>
銆€銆€绗笁锛孉lphaGo Zero鍦―NN缃戠粶缁撴瀯涓婂惛鏀朵簡鏈€鏂拌繘灞曪紝閲囩敤浜哛esNet缃戠粶涓殑Residual缁撴瀯浣滀负鍩虹妯″潡銆傝繎鍑犲勾娴佽鐨凴esNet鍔犲ぇ浜嗙綉缁滄繁搴︼紝鑰孏oogLeNet鍔犲ぇ浜嗙綉缁滃搴︺€備箣鍓嶅ぇ閲忚鏂囪〃鏄庯紝ResNet浣跨敤鐨凴esidual缁撴瀯姣擥oogLeNet浣跨敤鐨処nception缁撴瀯鍦ㄨ揪鍒扮浉鍚岄娴嬬簿搴︽潯浠朵笅鐨勮繍琛岄€熷害鏇村揩銆侫lphaGo Zero閲囩敤浜哛esidual搴旇鏈夐€熷害鏂归潰鐨勮€冭檻銆侟/div>
 
 
銆€銆€鏉滃厠澶у鍗氬+鐮旂┒鐢熻阿鐭ラ仴瀵规鍋氫簡杩涗竴姝ラ槓杩帮細
銆€銆€DeepMind鐨勬柊绠楁硶AlphaGo Zero寮€濮嬫憜鑴卞浜虹被鐭ヨ瘑鐨勪緷璧栵細鍦ㄥ涔犲紑濮嬮樁娈垫棤闇€鍏堝涔犱汉绫婚€夋墜鐨勮蛋娉曪紝鍙﹀杈撳叆涓病鏈変簡浜哄伐鎻愬彇鐨勭壒寰併€侟/div>
銆€銆€鍦ㄧ綉缁滅粨鏋勭殑璁捐涓婏紝鏂扮殑绠楁硶涓庝箣鍓嶇殑AlphaGo鏈変袱涓ぇ鐨勫尯鍒€傞鍏堬紝涓庝箣鍓嶅皢璧板瓙绛栫暐(policy)缃戠粶鍜岃儨鐜囧€?value)缃戠粶鍒嗗紑璁粌涓嶅悓锛屾柊鐨勭綉缁滅粨鏋勫彲浠ュ悓鏃惰緭鍑鸿姝ョ殑璧板瓙绛栫暐(policy)鍜屽綋鍓嶆儏褰笅鐨勮儨鐜囧€?value)銆傚疄闄呬笂policy涓巚alue缃戠粶鐩稿綋浜庡叡鐢ㄤ簡涔嬪墠澶ч儴鍒嗙殑鐗瑰緛鎻愬彇灞傦紝杈撳嚭闃舵鐨勬渶鍚庡嚑灞傜粨鏋勪粛鐒舵槸鐩镐簰鐙珛鐨勩€傝缁冪殑鎹熷け鍑芥暟涔熷悓鏃跺寘鍚簡policy鍜寁alue涓ら儴鍒嗐€傝繖鏍风殑鏄剧劧鑳藉鑺傜渷璁粌鏃堕棿锛屾洿閲嶈鐨勬槸娣峰悎鐨刾olicy涓巚alue缃戠粶涔熻鑳介€傚簲鏇村绉嶄笉鍚屾儏鍐点€侟/div>
銆€銆€鍙﹀涓€涓ぇ鐨勫尯鍒湪浜庣壒寰佹彁鍙栧眰閲囩敤浜?0鎴?0涓畫宸ā鍧楋紝姣忎釜妯″潡鍖呭惈2涓嵎绉眰銆備笌涔嬪墠閲囩敤鐨?2灞傚乏鍙崇殑鍗风Н灞傜浉姣旓紝娈嬪樊妯″潡鐨勮繍鐢ㄤ娇缃戠粶娣卞害鑾峰緱浜嗗緢澶х殑鎻愬崌銆侫lphaGo Zero涓嶅啀闇€瑕佷汉宸ユ彁鍙栫殑鐗瑰緛搴旇涔熸槸鐢变簬鏇存繁鐨勭綉缁滆兘鏇存湁鏁堝湴鐩存帴浠庢鐩樹笂鎻愬彇鐗瑰緛銆傛牴鎹枃绔犳彁渚涚殑鏁版嵁锛岃繖涓ょ偣缁撴瀯涓婄殑鏀硅繘瀵规鍔涚殑鎻愬崌璐$尞澶ц嚧鐩哥瓑銆侟/div>
銆€銆€鍥犱负杩欎簺鏀硅繘锛孉lphaGo Zero鐨勮〃鐜板拰璁粌鏁堢巼閮芥湁浜嗗緢澶х殑鎻愬崌锛屼粎閫氳繃4鍧桾PU鍜?2灏忔椂鐨勮缁冨氨鑳藉鑳滆繃涔嬪墠璁粌鐢ㄦ椂鍑犱釜鏈堢殑鍘熺増AlphaGo銆傚湪鏀惧純瀛︿範浜虹被妫嬫墜鐨勮蛋娉曚互鍙婁汉宸ユ彁鍙栫壒寰佷箣鍚庯紝绠楁硶鑳藉鍙栧緱鏇翠紭绉€鐨勮〃鐜帮紝杩欎綋鐜板嚭娣卞害绁炵粡缃戠粶寮哄ぇ鐨勭壒寰佹彁鍙栬兘鍔涗互鍙婂鎵炬洿浼樿В鐨勮兘鍔涖€傛洿閲嶈鐨勬槸锛岄€氳繃鎽嗚劚瀵逛汉绫荤粡楠屽拰杈呭姪鐨勪緷璧栵紝绫讳技鐨勬繁搴﹀己鍖栧涔犵畻娉曟垨璁歌兘鏇村鏄撳湴琚箍娉涘簲鐢ㄥ埌鍏朵粬浜虹被缂轰箯浜嗚В鎴栨槸缂轰箯澶ч噺鏍囨敞鏁版嵁鐨勯鍩熴€侟/div>
銆€銆€杩欎釜宸ヤ綔鎰忎箟浣曞湪鍛紵浜哄伐鏅鸿兘涓撳銆佺編鍥藉寳鍗$綏鑾辩撼澶у澶忔礇鐗瑰垎鏍℃椽闊暀鎺堜篃瀵圭煡绀惧彂琛ㄤ簡鐪嬫硶锛欬/div>
銆€銆€鎴戦潪甯镐粩缁嗕粠澶村埌灏捐浜嗚繖绡囪鏂囥€傞鍏堣鑲畾宸ヤ綔鏈韩鐨勪环鍊笺€備粠鐢ㄦ璋?supervised learning)鍒版墧妫嬭氨锛屾槸閲嶅ぇ璐$尞(contribution)锛佸共鎺変簡褰撳墠鏈€鐗涚殑妫嬫墜锛堝彉韬墠鐨勯樋娉曠嫍锛夛紝鏄痑dvancing state-of-the-art銆傜缁忕綉缁滅殑璁捐鍜岃缁冩柟娉曢兘鏈夋敼杩涳紝鏄垱鏂帮紙novelty锛夈€備粠搴旂敤瑙掑害锛屼互鍚庡彲鑳戒笉鍐嶉渶瑕佽€楄垂浜哄伐鍘讳负AI鐨勪骇鍝佸仛澶ч噺鐨勫墠鏈熷噯澶囧伐浣滐紝杩欐槸鍏舵剰涔?significance)鎵€鍦紒
銆€銆€鎺ョ潃锛屾椽鏁欐巿涔熺畝鍗曞洖椤句簡浜哄伐绁炵粡缃戠粶鐨勫巻鍙诧細
銆€銆€浜哄伐绁炵粡缃戠粶鍦ㄤ笂涓栫邯鍥涘崄骞翠唬灏卞嚭鏉ヤ簡锛屽皬鐏簡涓€涓嬪氨鎾戜笉涓嬪幓浜嗭紝鍏朵腑涓€涓師鍥犳槸澶у鍙戠幇杩欎笢瑗胯В鍐充笉浜嗏€滃紓鎴栭棶棰樷€濓紝鑰屼笖璁粌璧锋潵澶夯鐑︺€傚埌浜嗕笂涓栫邯涓冨崄骞翠唬锛孭aul Werbos璇诲崥鏃跺€欐嬁backpropagation鐨勭畻娉曟潵璁粌绁炵粡缃戠粶锛屾彁楂樹簡鏁堢巼锛岀敤澶氬眰绁炵粡缃戠粶鎶婂紓鎴栭棶棰樿В鍐充簡锛屼篃鎶婄缁忕綉缁滃甫鍏ヤ竴涓柊绾厓銆備笂涓栫邯鍏節鍗佸勾浠o紝浜哄伐绁炵粡缃戠粶鐨勭爺绌惰繋鏉ヤ簡涓€鍦哄ぇ鐏紝瀛︽湳鍦堝彂浜嗘垚鍗冧笂涓囩瘒鍏充簬绁炵粡缃戠粶鐨勮鏂囷紝浠庤璁″埌璁粌鍒颁紭鍖栧啀鍒板悇琛屽悇涓氱殑搴旂敤銆侟/div>
銆€銆€Jim Burke鏁欐巿锛屼竴涓簲骞村墠閫€浼戠殑IEEE Life Fellow锛屾浘缁忚杩囬偅涓勾浠g殑鏁呬簨锛氬幓寮€鐢靛姏绯荤粺鐨勫鏈細璁紝姣忚璁轰竴涓伐绋嬮棶棰橈紝涓嶇鏄暐锛屾€讳細鏈変竴甯汉璇磋繖鍙互鐢ㄧ缁忕綉缁滆В鍐筹紝褰撶劧鏈€鍚庝篃灏变笉浜嗕簡涔嬩簡銆傜畝鍗曠殑璇存槸澶у鎸栧潙鐏屾按鍚规场娉★紝鏈€鍚庢病鍟ュ彲蹇芥偁鐨勪簡锛屽氨鎵句釜鍒殑鍦板効鍐嶇户缁寲鍧戠亴姘村惞娉℃场銆備笂涓栫邯鏈殑瀛︽湳鍦堬紝濡傛灉鍑洪棬涓嶈鑷繁鎼炵缁忕綉缁滅殑閮戒笉濂芥剰鎬濊窡浜烘墦鎷涘懠锛屽氨鍜屽浠婄殑娣卞害瀛︿範銆佸ぇ鏁版嵁鍒嗘瀽涓€鏍枫€侟/div>
銆€銆€鐒跺悗锛屾椽鏁欐巿瀵逛汉宸ユ櫤鑳藉仛浜嗗苟涓嶅崄鍒嗕箰瑙傜殑灞曟湜锛欬/div>
銆€銆€鍥炲埌闃挎硶鐙椾笅妫嬭繖涓簨鍎匡紝浼撮殢鐫€澶ф暟鎹殑娴疆锛屾暟鎹寲鎺樸€佹満鍣ㄥ涔犮€佺缁忕綉缁滃拰浜哄伐鏅鸿兘绐佺劧闂村張鐏簡璧锋潵銆傝繖娆$伀鐨勬湁娌℃湁鏂欏憿锛熸垜璁や负鏄湁鐨勶紝鏈夋捣閲忕殑鏁版嵁銆佹湁璁$畻鑳藉姏鐨勬彁鍗囥€佹湁绠楁硶鐨勬敼杩涖€傝繖灏卞ソ姣斿綋骞存妸backpropagation鐢ㄥ湪绁炵粡缃戠粶涓婏紝鐨勭‘鏄釜绐佺牬銆侟/div>
銆€銆€鏈€缁堣繖涓伀鑳界儳澶氫箙锛岃繕寰楃湅绁炵粡缃戠粶鑳借В鍐冲灏戝疄闄呴棶棰樸€備簩鍗佸勾鍓嶇殑澶х伀涔嬪悗锛岃绁炵粡缃戠粶鈥滆В鍐斥€濈殑瀹為檯闂瀵ュ鏃犲嚑锛屽叾涓竴涓瘮杈冪煡鍚嶇殑鏄數鍔涜礋鑽烽娴嬮棶棰橈紝灏辨槸鐢ㄧ數閲忛娴嬶紝鍒氬ソ鏄垜鐨勪笓涓氥€傜敱浜庡綋骞寸缁忕綉缁滆繃浜庣伀鐖嗭紝瀵艰嚧绉戠爺閲嶅績鍑犱箮瀹屽叏绂诲紑浜嗕紶缁熺殑缁熻鏂规硶銆傜瓑鎴戝垰杩涘叆杩欎釜棰嗗煙鍋氬崥澹鏂囩殑鏃跺€欙紝灏辨嬁浼犵粺鐨勫鍏冨洖褰掓ā鍨嬬鏉€浜嗗競闈笂鐨勫悇绉嶇缁忕綉缁滈仐浼犵畻娉曠殑銆傛垜涓€璐殑鐪嬫硶锛屽浜庣溂鍓嶆祦琛岀殑涓滆タ锛屼笉瑕佺洸鐩拷閫愶紝瑕佸厛瀹℃椂搴﹀娍锛岀湅鐪嬭嚜宸辨搮闀垮暐銆佹湁鍟ョН绱紝鐪嬪噯浜嗗潙鍐嶈烦銆侟/div>
銆€銆€缇庡浗瀵嗘瓏鏍瑰ぇ瀛︿汉宸ユ櫤鑳藉疄楠屽涓讳换Satinder Singh涔熻〃杈句簡鍜屾椽鏁欐巿绫讳技鐨勮鐐癸細杩欏苟闈炰换浣曠粨鏉熺殑寮€濮嬶紝鍥犱负浜哄伐鏅鸿兘鍜屼汉鐢氳嚦鍔ㄧ墿鐩告瘮锛屾墍鐭ユ墍鑳戒緷鐒舵瀬绔湁闄愶細
銆€銆€This is not the beginning of any end because AlphaGo Zero,like all other successful AI so far,is extremely limited in what it knows and in what it can do compared with humans and even other animals.
銆€銆€涓嶈繃锛孲ingh鏁欐巿浠嶇劧瀵归樋娉曞厓澶у姞璧炶祻锛氳繖鏄竴椤归噸澶ф垚灏?鏄剧ず寮哄寲瀛︿範鑰屼笉渚濊禆浜虹殑缁忛獙锛屽彲浠ュ仛鐨勬洿濂斤細
銆€銆€The improvement in training time and computational complex卢ity of AlphaGo Zero relative to AlphaGo,achieved in about a year,is a major achieve卢ment鈥he results suggest that AIs based on reinforcement learning can perform much better than those that rely on human expertise.
銆€銆€闄堟€$劧鏁欐巿鍒欏浜哄伐鏅鸿兘鐨勬湭鏉ュ仛浜嗚繘涓€姝ョ殑鎬濊€冿細
銆€銆€AlphaGo Zero娌℃湁浣跨敤浜虹被鏍囨敞锛屽彧闈犱汉绫荤粰瀹氱殑鍥存瑙勫垯锛屽氨鍙互鎺ㄦ紨鍑洪珮鏄庣殑璧版硶銆傛湁瓒g殑鏄紝鎴戜滑杩樺湪璁烘枃涓湅鍒颁簡AlphaGo Zero鎺屾彙鍥存鐨勮繃绋嬨€傛瘮濡傚浣曢€愭笎瀛︿細涓€浜涘父瑙佺殑瀹氬紡涓庡紑灞€鏂规硶锛屽绗竴鎵嬬偣涓変笁銆傜浉淇¤繖涔熻兘瀵瑰洿妫嬬埍濂借€呯悊瑙lphaGo鐨勪笅妫嬮鏍兼湁鎵€鍚彂銆侟/div>
銆€銆€闄や簡鎶€鏈垱鏂颁箣澶栵紝AlphaGo Zero鍙堜竴娆″紩鍙戜簡涓€涓€煎緱鎵€鏈変汉宸ユ櫤鑳界爺绌惰€呮€濊€冪殑闂:鍦ㄦ湭鏉ュ彂灞曚腑锛屾垜浠┒绔熷簲璇ュ浣曠湅寰呬汉绫荤粡楠岀殑浣滅敤銆傚湪AlphaGo Zero鑷富瀛︿細鐨勮蛋娉曚腑锛屾湁涓€浜涗笌浜虹被璧版硶涓€鑷达紝鍖哄埆涓昏鍦ㄤ腑闂寸浉鎸侀樁娈点€侫lphaGo Zero宸茬粡鍙互缁欎汉绫诲綋鍥存鑰佸笀锛屾寚瀵间汉绫绘€濊€冧箣鍓嶆病瑙佽繃鐨勮蛋娉曪紝鑰屼笉鐢ㄥ畬鍏ㄦ嫎娉ヤ簬鍥存澶у笀鐨勭粡楠屻€備篃灏辨槸璇碅lphaGo Zero鍐嶆鎵撶牬浜嗕汉绫荤粡楠岀殑绁炵鎰燂紝璁╀汉鑴戜腑褰㈡垚鐨勭粡楠屼篃鏄彲浠ヨ鎺㈡祴鍜屽涔犵殑銆侟/div>
銆€銆€闄堟暀鎺堟渶鍚庝篃鎻愬嚭涓€涓湁瓒g殑鍛介锛欬/div>
銆€銆€鏈潵鎴戜滑瑕侀潰瀵圭殑涓€涓寫鎴樺彲鑳藉氨鏄?鍦ㄤ竴浜涗笌鏃ュ父鐢熸椿鏈夊叧鐨勫喅绛栭棶棰樹笂锛屼汉绫荤粡楠屽拰鏈哄櫒缁忛獙鍚屾椂瀛樺湪锛岃€屾満鍣ㄧ粡楠屼笌浜虹被缁忛獙鏈夊緢澶у樊鍒紝鎴戜滑鍙堣濡備綍鍘婚€夋嫨鍜屽埄鐢ㄥ憿锛烖/div>
銆€銆€涓嶈繃David Silver瀵规骞朵笉鎷呭績锛岃€屽鏈潵鍏呮弧淇″績銆備粬鎸囧嚭锛欬/div>
銆€銆€If similar techniques can be applied to other structured problems,such as protein folding,reducing energy consumption or searching for revolutionary new materials,the resulting breakthroughs have the potential to positively impact society.

鐩稿叧鏂伴椈