锘??xml version="1.0" encoding="utf-8" standalone="yes"?>
嬈㈣繋鏉ュ埌Hadoop鍛ㄥ垔鍛ㄤ竴鐗瑰埆鐗堛傛湰鍛ㄦ湁澶ч噺鏉ヨ嚜Spark銆?/span>Kafka銆?/span>Beam銆?/span>Kudu鐨勬妧鏈柊闂匯傚鏋滀綘姝e湪瀵繪壘涓浜涙洿鍓嶆部鐨勬妧鏈紝Apache Metron錛堝鍖栦腑錛夊彂甯冧簡瀹冧滑絎竴涓増鏈?/span>Metron錛屾槸涓涓瀯寤哄湪Hadoop涓婃鍦ㄤ笉鏂彂灞曠殑閫氱敤瀹夊叏緋葷粺銆?/span>
鎶鏈柊闂?/span>
鏈枃浠嬬粛浜嗗浣曞湪AWS涓婃瀯寤烘祦寮忓鐞嗙郴緇熴傚寘鎷簡璇稿Amazon Kinesis 銆?/span>AWS Lambda銆?/span>Kineses S3 connector涔嬬被綆鍗曠殑鎼厤鏂規錛屼篃浠嬬粛浜?/span>AWS瀹炵幇瀹炴椂鍒嗘瀽鍦烘櫙榪欐牱鐩稿澶嶆潅鐐圭殑鏂規銆?/span>
鏈枃浠嬬粛浜嗘庢牱浣跨敤Spark Testing Base銆?/span>Spark Testing Base鏄竴涓敤Scala緙栧啓錛岄氳繃Java璋冪敤鐨?/span>Spark嫻嬭瘯妗嗘灦銆傛湰鏂囩殑鏍蜂緥浠g爜灞曠ず浜嗗浣曢殧紱繪祴璇曢昏緫閲嶆瀯Spark浠g爜錛屽悓鏃惰繕閫氳繃Java澶勭悊浜嗕竴浜涜噧鑲跨殑Scala API銆?/span>
http://www.jesse-anderson.com/2016/04/unit-testing-spark-with-java/
Altiscale鍗氬姒傝堪浜嗗湪Spark鐜涓嬶紝鏋勫緩thin鍜?/span>uber jar鍖呯殑浼樺姡銆傜ず鑼冧簡鍦?/span>Maven鍜?/span>SBT鍒嗗埆鏋勫緩涓ょ鍖呯殑鎯呭喌銆?/span>
https://www.altiscale.com/blog/spark-on-hadoop-thin-jars/
LinkedIn浠嬬粛浜嗕粬浠殑Kafka鐢熸佺郴緇燂紝鐢熸佺郴緇熷寘鍚竴涓壒孌婄殑Kafka producer錛屼竴涓負闈?/span>Java瀹㈡埛绔彁渚涚殑REST API錛屼竴涓?/span>avro妯″紡娉ㄥ唽琛紝浠ュ強Gobblin錛堣杞芥暟鎹埌Hadoop鐨勫伐鍏鳳級絳夌瓑銆?/span>
https://engineering.linkedin.com/blog/2016/04/kafka-ecosystem-at-linkedin
璇?/span>Spark Streaming鏁欑▼浠嬬粛浜嗘庢牱閫氳繃twitter4j API鎷夋帹鏂囷紝鍩轟簬鏍囩榪囨護錛屽鎺ㄦ枃榪涜鎯呮劅鍒嗘瀽銆?/span>
https://www.mapr.com/blog/spark-streaming-and-twitter-sentiment-analysis
Apache Kudu錛堝鍖栦腑錛夋槸Apache Impala錛堝鍖栦腑錛夌殑緇濅匠浼翠荊錛屽洜涓哄畠鑳介珮鏁堝湴瑙e喅騫挎硾鐨勫垎鏋愬拰鏈夐拡瀵規х殑鏌ヨ銆傛湰鏂囨弿榪頒簡涓よ呴泦鎴愮殑鎶鏈粏鑺傦紝渚嬪Kudu鐨勮璁″浣曚繚璇侀珮鏁堝湴鏌ヨ鑳藉姏錛屽浣曢氳繃Impala鍜?/span>Kudu鎵ц鍐欙紡鏇存柊錛忓垹闄ゆ搷浣滅瓑絳夈?/span>
http://blog.cloudera.com/blog/2016/04/how-to-use-impala-and-kudu-together-for-analytic-workloads/
MapR鎾版枃浠嬬粛浜嗕嬌鐢?/span>spark-sklearn鎵╁睍涓涓凡瀛樺湪鐨?/span>scikit-learn妯″瀷銆傛枃绔犱粙緇嶄簡濡備綍閫忚繃Airbnb鏁版嵁闆嗗唴閮ㄥ緩妯★紝榪樹粙緇嶄簡濡備綍鍌嶇潃spark-sklearn榪涜浜ゅ弶楠岃瘉銆?/span>
https://www.mapr.com/blog/predicting-airbnb-listing-prices-scikit-learn-and-apache-spark
AWS澶ф暟鎹崥瀹㈠啓浜嗕釜濡備綍鍦?/span>Amazon EMR涓嬌鐢?/span>HBase鍜?/span>Hive鐨勬暀紼嬨傛湰鏁欑▼浠嬬粛浜?/span>HBase錛屾弿榪頒簡濡備綍鍦?/span>S3涓仮澶?/span>HBase琛紝紺鴻寖浜?/span>Hive鍜?/span>HBase濡備綍闆嗘垚絳夌瓑銆?/span>
鏈枃鎻忚堪浜嗕負瀛︾敓鍦ㄥぇ鏁版嵁璇劇▼涓婃彁渚涘疄鎴樼粡楠岀殑鎸戞垬銆備綔鑰呯粡鍘嗚嫢騫叉鐨勮凱浠e拰閫夋嫨浼間箮鏈変簡涓涓ソ鏂規— Altiscale鐨?/span>Hadoop-as-a-Service銆?/span>
https://www.altiscale.com/blog/hadoop-as-a-service-in-the-classroom/
Cloudera鍗氬鐨勪竴綃囧鍋氭枃绔狅紝浣滆呮瘮杈冧簡Parquet鍜?/span>Avro鍦ㄨ法涓や釜鏁版嵁闆嗙殑涓嶅悓澶勭悊鏂瑰紡錛堜竴涓暟鎹泦紿?/span>(3鍒?/span>)銆佷竴涓暟鎹泦瀹?/span>(103鍒?/span>)錛夈傚湪鐢?/span>Spark鍜?/span>Spark SQL嫻嬭瘯鏌ヨ錛忔搷浣滃悗錛屼綔鑰呭彂鐜?/span>Parquet鍜?/span>Avro鍦ㄦ煡璇㈠簭鍒楀寲鏁版嵁鏂歸潰鏈夋椂琛ㄧ幇寰堢被浼鹼紝灝界鍦ㄥぇ澶氭暟鎯呭喌涓嬫煡璇?/span>Parquet鏁版嵁鐨勬椂鍊欐洿蹇偣錛堝簭鍒楀寲鏁版嵁鏇村皬錛夈?/span>
http://blog.cloudera.com/blog/2016/04/benchmarking-apache-parquet-the-allstate-experience/
鏈枃浠嬬粛浜嗗浣曞湪CDH榪欐牱鐨勫垎甯冨紡鐜涓嬌鐢?/span>SparkR錛屽敖綆?/span>SparkR瀹樻柟榪樻病鏈夋敮鎸佽繖縐嶆柟寮忋傚熷姪YARN鍦?/span>worker鏈湴瀹夎R璇█鍖咃紝job紼嶅姞鏀歸犲氨鑳芥墽琛屼簡銆?/span>
http://www.nodalpoint.com/sparkr-in-cloudera-hadoop/
寰堝寮婧愭鏋墮兘鑳芥墽琛?/span>MapReduce浠ュ強鍊熷姪鏇撮珮綰х殑緙栫▼妯″瀷瀹屾垚綾諱技鐨勫伐浣溿傜旱瑙傝繃鍘伙紝瀹冧滑渚濊禆鐙珛榪愯鐨勬鏋訛紙渚嬪MapReduce, Storm錛夛紝浣嗘槸鏈榪戠殑鏌愪簺鍙樺寲浣垮緱榪欎竴鍒囧厖婊′簡鍙樻暟銆?/span>Apache Beam錛堝鍖栦腑錛夋洿榪涗竴姝ュ湴璺ㄨ秺浜嗘壒澶勭悊銆佹祦寮忓鐞嗕袱縐嶆墽琛屾ā寮忥紝鍐呯疆鏇村姞澶嶆潅鐨勮綆楁ā鍨嬨?/span>
http://www.datanami.com/2016/04/22/apache-beam-emerges-ambitious-goal-unify-big-data-development/
Apache鍗氬鍙戝竷浜?/span>HBase鍦?/span>HDD銆?/span>SSD浠ュ強RAMDISK涓婄殑鍐欏叆鎬ц兘嫻嬭瘯姣斿鐨?/span>7綃囩郴鍒楁枃绔犮傞氳繃榪欎竴鍒嗘瀽錛屼綔鑰呭彂鐜板茍鎻愯鍦?/span>HBase鍜?/span>HDFS涓婂疄鐜頒竴浜涙湭瑕嗙洊鐨勫姛鑳姐?/span>
https://blogs.apache.org/hbase/entry/hdfs_hsm_and_hbase_part
鍏朵粬鏂伴椈
Tom White錛?/span>“Hadoop鏉冨▉鎸囧崡”鐨勪綔鑰呮挵鏂囦粙緇嶄粬鏄浣曟鍏?/span>Apache Hadoop孌垮爞鐨勩備粬鐨勬棭鏈熻礎鐚槸緇曠潃Hadoop涓?/span>Amazon Web Services闆嗘垚灞曞紑錛岃屼粖AWS宸叉垚涓?/span>Hadoop欏圭洰鎴愬姛鐨勯噸瑕侀儴鍒嗐?/span>
http://vision.cloudera.com/how-i-got-into-hadoop/
Fluo錛屼負Apache Accumulo鍑嗗鐨勫垎甯冨紡澶勭悊寮曟搸錛屽悜Apache瀛靛寲鍣ㄦ彁浜や簡瀛靛寲鐢寵銆?/span>
https://wiki.apache.org/incubator/FluoProposal
Apache Phoenix瀹e竷灝嗗湪HBaseCon鍚庝婦琛屼細璁紝Apache Phoenix鏄竴涓?/span>SQL-on-HBase緋葷粺銆傝浼氳鍙湁鍗婂ぉ錛屼富棰樻槸浠嬬粛Phoenix鍐呴儴鎯呭喌鍜岀敤渚嬨?/span>
http://hortonworks.com/blog/announcing-first-annual-phoenixcon-apache-phoenix-user-conference/
浜у搧鍙戝竷
Apache Metron錛屾瀯寤轟簬Hadoop涓婄殑瀹夊叏妗嗘灦錛屽彂甯冧簡0.1鐗堛?/span>Hortonworks鏀拺鍏朵綔涓烘妧鏈瑙堢増錛屽茍鎾板啓鏈枃浠嬬粛浜嗗浣曚笂鎵嬶紝濡備綍璐$尞錛屽浣曚嬌鐢?/span>Metron UI絳夌瓑銆?/span>
http://hortonworks.com/blog/apache-metron-tech-preview-1-come-get/
http://hortonworks.com/blog/apache-metron-use-case-finding-needle-haystack/
Apache NiFi鏈懆鍙戝竷浜?/span>0.6.1鐗堛傝繖鏄慨澶嶄簡10澶氫釜bug鍚庣殑淇鐗堛?/span>
Apache Flink鏈懆鍙戝竷浜?/span>1.0.2鐗堛傛湰嬈″彂甯冨寘鎷簡bug淇錛?/span>RocksDB鐜涓嬬殑鎬ц兘鎻愬崌浠ュ強涓浜涙枃妗f柟闈㈢殑榪涙銆?/span>
http://flink.apache.org/news/2016/04/22/release-1.0.2.html
Amazon鍙戝竷浜嗘柊鐗?/span>Amazon EMR錛屽紑濮嬫敮鎸?/span>HBase 1.2銆?/span>
https://aws.amazon.com/blogs/aws/amazon-emr-update-apache-hbase-1-2-is-now-available/
媧誨姩
涓浗
鏃?/span>
2016騫?/span>4鏈?/span>17鏃?/span>
鍚槑鏄熻景——騫沖彴鍜屽ぇ鏁版嵁鏁翠綋緇勭紪璇?nbsp;
Hortonworks鍦ㄦ湰鍛?/span>Hadoop嬈ф床宄頒細涓婃湁鑻ュ共鐖嗘枡錛岃瘡絀夸簡鏈湡鏁翠釜鍐呭銆備即闅忕潃楠勪漢鐨勬柊鐗規э紝Apache Storm鍙戝竷浜?/span>1.0.0鐗堛傚湪鎶鏈柊闂繪柟闈紝鏈変笉灝戝熀浜?/span>Kafka鏋勫緩澶ц妯℃湇鍔″拰鍒嗗竷寮忕郴緇熸祴璇曠殑鏂囩珷銆傚鏋滀綘閿欒繃浜?/span>Hadoop宄頒細錛岄偅涔堜笉鐢ㄦ媴蹇冿紝婕旇瑙嗛宸茬粡鏀懼埌浜嗙綉涓娿?/span>
鎶鏈柊闂?/span>
Smyte鎾版枃浠嬬粛浜嗕粬浠熀浜庝簨浠舵暟鎹祦瀹炴椂媯嫻嬪瀮鍦鵑偖浠跺拰璇堥獥淇℃伅鐨勫熀紜璁炬柦銆傛渶鍒濈殑浜嬩歡澶勭悊緋葷粺鏋勫緩鍦?/span>Kafka銆?/span>Redis銆?/span>Secor浠ュ強S3涓婏紝涓轟簡婊¤凍瑙勬ā涓嶆柇鎵╁紶鍜屽粔浠風殑瑕佹眰錛屼粬浠妸緋葷粺榪佺Щ鍒板熀浜庣鐩樼殑鏂規涓婏紝浣跨敤Redis鍗忚涓?/span>RocksDB浜や簰錛屼嬌鐢?/span>Kafka榪涜澶嶅埗銆?/span>
https://medium.com/the-smyte-blog/counting-with-domain-specific-databases-73c660472da
鏈枃鎶?/span>rsyslog銆?/span>Kafka銆?/span>AWS 涓?/span>ELK鏍堬紙ElasticSearch銆?/span>Logstash銆?/span>Kibana錛夌粨鍚堬紝澶勭悊璇稿鍙嶅帇銆佽妯′互鍙婄淮鎶ゆ柟闈㈢殑闂銆傛湰鏂囪鐩栦簡rsyslog闆嗘垚Kafka浠ュ強schema鏂歸潰鐨勬妧宸э紝涔熶粙緇嶄簡濡備綍榪愯Kafka銆?/span>Zookeeper浠ュ強AWS涓ぇ瑙勬ā鑷姩鍒嗙粍銆?/span>
https://www.bashton.com/blog/2016/elk-on-ark/
Hortonworks鎾版枃浠嬬粛浜?/span>Apache Atlas浠ュ強Apache Range灝嗚寮曞叆鐨勬暟鎹鐞嗙壒鎬с傝繖浜涚壒鎬ф槸錛氬垎綾昏闂帶鍒躲佹暟鎹湁鏁堟湡絳栫暐銆佷綅緗壒鎬х瓥鐣ャ佺姝㈡暟鎹泦緇勫悎銆佽法緇勪歡瀹舵棌錛堜緥濡備粠Kafka鍒?/span>Storm鍐嶅埌Hive鐨勬暟鎹窡韙級銆?/span>
http://hortonworks.com/blog/the-next-generation-of-hadoop-based-security-data-governance/
Apache HAWQ 錛堝鍖栦腑錛夋槸涓涓熀浜?/span>Greenplum鍦?/span>HDFS涓婃彁渚涙暟鎹煡璇㈢殑SQL寮曟搸銆傛湰鏂囪璁轟簡鍏跺吀鍨嬭璁′互鍙婃柊鐗堟湰鐨勮澶氭敼榪涖傚寘鎷畠涓?/span>Spark鍜?/span>MapReduce鐨勫尯鍒紝榪樻湁浜?/span>Hadoop鎸戞垬緇忓吀MPP璁捐鐨勫唴瀹癸紝浠ュ強HAWQ鐨勬柊璁捐鎬庢牱緇撳悎MPP鍜屾壒澶勭悊鎶鏈繘鑰屼嬌鍏朵袱鑰呭吋欏俱?/span>
Cloudera鍗氬鎾版枃浠嬬粛浜嗗Hadoop鍒嗗竷寮忕郴緇熻繘琛屾晠闅滄敞鍏ャ佺粍緗戠殑嫻嬭瘯宸ュ叿AgenTEST銆傚畠鑳芥敞鍏ョ綉緇滄晠闅滐紙渚嬪涓㈠寘錛夛紝璧勬簮婊¤澆錛堜緥濡?/span>CPU銆?/span>IO銆佺鐩樼┖闂達級絳夌瓑銆傚綋嫻嬭瘯緗戠粶鍒嗗尯鏃訛紝鍙互璇勪及鐜艦緇勭綉銆佹ˉ鎺ョ粍緗戠瓑絳夈?/span>
Hortonworks鍗氬灞曟湜浜嗗皢鍖呭惈鏂扮増鏈?/span>Spark鍜?/span>Zeppelin鐨?/span>HDP 2.4.2銆?/span>Spark2.0棰勮鐗堝拰Zeppelin鏂扮壒鎬ч兘灝嗗寘鍚湪鍐呫?/span>
http://hortonworks.com/blog/apache-spark-apache-zeppelin-whats-coming-in-hdp-2-4-2/
Cask鎾版枃浠嬬粛浜嗗湪Hbase region compaction榪欐牱緗曡浜嬩歡鍙戠敓鐨勫墠鍚庯紝浠栦滑鏄庢牱閫氳繃闀挎椂闂存祴璇曚互璇勪及鍒嗗竷寮忕郴緇熸紜х殑銆?/span>
http://blog.cask.co/2016/04/long-running-tests-in-cdap/
鏈枃浠嬬粛浜嗗浣曠粨鍚?/span>SparkR涓庝簹椹?/span>EMR榪涜鍦扮悊絀洪棿鍒嗘瀽鐨勩傞氳繃SparkR鐨?/span>Hive闆嗘垚緇勪歡錛屽彲浠ョ珛鍒誨熀浜?/span>S3涓婄殑鏁版嵁鏄犲皠Hive澶栭儴琛ㄣ備粠榪欏紑濮嬶紝鏁版嵁灝辮兘鐩存帴鍔犺澆鍒板唴瀛樹腑浣跨敤R璇█鍒嗘瀽錛屽緢瀹規槗瀹炵幇楂樿川閲忕殑鏁版嵁鍙鍖栥?/span>
MapR緙栧啓浜嗕嬌鐢?/span>Pig鍜?/span>Hive鍒嗘瀽鑱屼笟媯掔悆澶ц仈鐩熺悆闃熸按騫崇殑鏁欑▼銆?/span>Pig鐢ㄤ簬鏁版嵁鍒濆姞宸ワ紝Hive鎻愪緵鍩轟簬SQL鐨勬暟鎹煡璇㈢幆澧冦傚熷姪Hive ODBC椹卞姩鍜?/span>Hive鏈嶅姟鍣紝浣垮緱寰蔣Excel涔熻兘鐢ㄤ簬鑾峰彇鍜屽垎鏋愭暟鎹?/span>
https://www.mapr.com/blog/using-hive-and-pig-baseball-statistics
SignalFX閫氳繃27鑺傜偣鐨?/span>Kafka闆嗙兢姣忓ぉ澶勭悊700澶氫嚎鏉℃秷鎭傚彧鏈夊熀浜庝粬浠Н绱殑澶ц妯?/span>Kafka浣跨敤緇忛獙鎵嶈兘鏈夊姝ら珮鐨勯噺錛屽洜姝や粬浠叡浜簡涓嶅皯璋冭瘯Kafka鐨勬妧宸э紝瀹氫綅鍛婅錛堜緥濡傛棩蹇楀埛鏂板歡榪熷鍔狅級錛屼互鍙?/span>Kafka妯悜鎵╁睍銆?/span>
http://www.confluent.io/blog/how-we-monitor-and-run-kafka-at-scale-signalfx
dataArtisan's鍗氬涓轟簡搴﹂噺Flink鍦ㄦ暟鎹祦鏁堢巼銆佷綆寤惰繜銆佹紜т笂鐨勮兘鍔涳紝涓撻棬鍐欎簡榪欑瘒鏂囩珷銆備負浜嗚瘉鏄庢晥鐜囷紝鍦ㄩ珮鍚炲悙閲忕殑鐜涓嬭繍琛屼簡鏈鏂扮殑Yahoo!嫻佸紡鍩哄噯嫻嬭瘯紼嬪簭銆傚湪姝g‘鎬ф柟闈紝鏂囩珷紿佸嚭浜?/span>Flink浜嬩歡鍒ゅ埆鍜屽鐞嗕簨浠訛紙鏄熺悆澶ф垬鐢靛獎騫磋〃鍋氱被姣旓級鏂歸潰鐨勪紭鍔褲傛渶鍚庯紝鏂囩珷鎻忚堪浜?/span>Flink鏈潵鐗堟湰鍩轟簬鍐呭瓨鐨勬煡璇換鍔°?/span>
http://data-artisans.com/counting-in-streams-a-hierarchy-of-needs/
鏈暀紼嬩粙緇嶄簡鎬庢牱鎶?/span>TCP Socket涓殑鏂囨湰鏁版嵁嫻佽漿鎹負Spark嫻佸紡鏁版嵁婧愩?/span>
https://medium.com/@anicolaspp/spark-custom-streaming-sources-e7d52da72e80
鏈枃浠嬬粛浜嗗湪鏋勫緩Hadoop鐨勬椂鍊欐庢牱闃叉AWS璇佷功鎰忓鎻愪氦鍒拌ˉ涓佹垨git璧勬簮搴撱傞櫎Hadoop鏈韓澶栵紝鏈枃榪樺緩璁嬌鐢?/span>“git-secrets”宸ュ叿闃叉鎰忓鎻愪氦璁塊棶/瀹夊叏瀵嗛挜銆傚鏋滀綘鐢ㄧ殑鏄?/span>Hadoop S3錛岃繕鎺ㄨ崘浜嗘柊琛ヤ竵渚涜瘎浼般?/span>
http://steveloughran.blogspot.co.uk/2016/04/testing-against-s3-and-object-stores.html
Big Data & Brews閲囪浜?/span>MapR鐨?/span>Ted Dunning鍜?/span>Jacques Nadeau銆?/span>Apache Arrow涔熷湪鏈閲囪鑼冨洿鍐呫?/span>
https://www.youtube.com/watch?v=l3mDDKjDjMk
https://www.youtube.com/watch?v=Xo9CO0a0VJI
鍏朵粬鏂伴椈
DataEngConf鏈榪戝湪鏃ч噾灞卞彫寮銆傛湰鏂囨葷粨浜?/span>Uber銆?/span>Stripe銆?/span>Microsoft銆?/span>Instacart銆?/span>Jawbone鐨勫彂璦鍐呭銆備篃浠嬬粛浜嗕細璁富棰?/span>“鏁版嵁縐戝鍦ㄧ幇瀹炰笘鐣屼腑鏄竴涓駭鍝佸拰宸ョ▼瀛︾”銆?/span>
Hortonworks鍦ㄤ笂鍛ㄩ兘鏌忔灄涓捐鐨?/span>Hadoop嬈ф床宄頒細涓婂ぇ鏀懼紓褰┿?/span>ZDNet鎶ュ浜嗚繖浜涗寒鐐癸紝鍏朵腑鍖呮嫭涓?/span>Pivotal錛堝凡杞敭緇?/span>HDP錛夌殑鎵╁睍鍚堜綔錛屼笌Syncosrt鐨勮漿鍞崗璁紝浠ュ強Atlas銆?/span>Ranger銆?/span>Zeppelin銆?/span>Metron鐨勬妧鏈瑙堛傛姤瀵艱繕浠嬬粛浜?/span>Hortonworks銆?/span>Cloudera銆?/span>MapR浜у搧鐨勪笉鍚屼箣澶勩?/span>
Flink 2016宄頒細灝嗗湪涔濇湀浜庡痙鍥芥煆鏋椾婦琛屻傝璁鴻棰樺緛闆嗗皢浜庡叚鏈堟湯緇撴潫銆?/span>
http://flink.apache.org/news/2016/04/14/flink-forward-announce.html
YouTube涓婂彂甯冧簡Hadoop閮芥煆鏋楀嘲浼氭紨璁茶棰戙傛濡傞鏈熺殑閭f牱錛岃繖浜涙紨璁插唴瀹規兜鐩?/span>Hadoop鐢熸佺郴緇熺殑鍚勪釜閮ㄥ垎銆?/span>
浜у搧鍙戝竷
Metascope鏄竴涓厤鍚?/span>Schedoscope鍦?/span>Hadoop闆嗙兢涓繘琛屽厓鏁版嵁綆$悊鐨勬柊宸ュ叿銆傞氳繃web鐣岄潰錛屽埄鐢ㄦ暟鎹部琚畠鑳芥礊瀵熷ぇ閲忕殑鏁版嵁銆備篃鎻愪緵媯绱€佸唴宓屾枃妗c?/span>REST API絳夌瓑鍔熻兘銆?/span>
https://github.com/ottogroup/metascope
Apache HBase 1.2.1浜庢湰鍛ㄥ彂甯冿紝鍦?/span>1.2.0鐨勫熀紜涓婅В鍐充簡27涓棶棰樸傚彂甯冨0鏄庝腑閲嶇偣浠嬬粛浜嗗洓涓珮浼樺厛綰х殑闂銆?/span>
Apache Mahout鏈哄櫒瀛︿範搴撳彂甯冧簡0.12.0鐗堛傝鐗堟湰鐨?/span>“Samsara”鏁板鐜寮濮嬫敮鎸?/span>Apache Flink浜嗭紝騫朵笖鏄鉤鍙版棤鍏崇殑銆傚彂甯冨0鏄庝腑鍒嗕韓浜嗕笌Flink闆嗘垚銆佸凡鐭ラ棶棰樸侀」鐩紨榪涜鍒掔浉鍏崇殑鍐呭銆?/span>
Apache Storm 1.0.0鏈懆鍙戝竷浜嗐備寒鐐瑰寘鎷ц兘鎻愬崌錛堟櫘閬嶆彁鍗?/span>3鍊嶄互涓婏級銆佹柊鐨勫垎甯冨紡緙撳瓨API銆?/span>nimbus鐨勯珮鍙敤鎬с佽嚜鍔ㄥ弽鍘嬨佸姩鎬?/span>worker鎬ц兘鍒嗘瀽絳夌瓑銆?/span>
http://storm.apache.org/2016/04/12/storm100-released.html
Apache Kudu錛堝鍖栦腑錛夋湰鍛ㄥ彂甯冧簡0.8.0鐗堛傛湰嬈″彂甯冩坊鍔犱簡Apache Flume sink銆侀儴鍒嗗姛鑳芥彁鍗囥佷慨澶嶄簡涓鎵?/span>bug銆?/span>
http://getkudu.io/releases/0.8.0/docs/release_notes.html
Cloudbreak鏈懆鍙戝竷浜?/span>1.2鐗堬紝瀹冧負浜戠幆澧冩彁渚?/span>Hadoop闆嗙兢Docker銆傛柊鐗規у寘鎷敮鎸?/span>OpenStack浠ュ強涓鴻嚜瀹氫箟鏈嶅姟鍣ㄦ彁渚涢厤緗剼鏈?/span>
http://hortonworks.com/blog/announcing-cloudbreak-1-2/
Cloudera鍙戝竷浜?/span>Cloudera Enterprise 5.4.10錛屽唴緗簡Flume銆?/span>Hadoop銆?/span>HBase銆?/span>Hive銆?/span>Impala絳夌粍浠躲?/span>
Presto Accumulo鏄釜鏂伴」鐩紝涓?/span>Accumulo璇誨啓鏁版嵁鎻愪緵浜?/span>Presto榪炴帴鍣ㄣ?/span>
https://github.com/bloomberg/presto-accumulo
媧誨姩
涓浗
鏃?/span>
絎?165 鏈?2016騫?鏈?0鏃?
鍚槑鏄熻景——騫沖彴鍜屽ぇ鏁版嵁鏁翠綋緇勭紪璇?/strong>
鏈懆錛屽寘鎷?/span>LinkedIn 鍜?/span>Airbnb鏂板紑婧愰」鐩湪鍐呯殑鏁頒釜浜у搧榪涜浜嗛噸澶х増鏈彂甯冦傛湰鏈熸妧鏈儴鍒嗕笌嫻佸紡澶勭悊鏈夊叧——Spark銆?/span>Flink銆?/span>Kafka絳夌瓑錛涙柊闂婚儴鍒嗘槸鍏充簬Spark Summit 鍜?/span>HbaseCon鐨勪細璁紼嬨?/span>
Zalando鍙戣〃浜嗕粬浠槸濡備綍閫夋嫨Apache Flink浣滀負嫻佸紡澶勭悊妗嗘灦鐨勬枃绔犮傝鏂囩珷闃愯堪浜嗗璇勪環鏍囧噯榪涜楠岃瘉鍚庡緱鍑虹殑緇撹錛岄槓鏄庝簡閫夋嫨Apache Flink鐨勪富鍥?/span>—鍦ㄩ珮鍚炲悙閲忕殑鎯呭喌涓嬩緷鐒惰兘淇濇寔浣庡歡榪燂紝鐪熸鐨勬祦寮忓鐞嗭紝寮鍙戜漢鍛樻敮鎸併?/span>
https://tech.zalando.com/blog/apache-showdown-flink-vs.-spark/
Cloudera鍗氬鍒婄櫥浜嗘潵鑷?/span>Wargaming.net鐨勬枃绔狅紝閫氳繃鏈枃鍙簡瑙e埌浠栦滑濡備綍閫氳繃Kafka銆?/span>HBase銆?/span>Drools銆?/span>Spark鏋勫緩瀹炴椂澶勭悊鍩虹璁炬柦鐨勩傚彟澶栵紝鍦ㄦ暟鎹祦紼嬫柟闈紝浠栦滑浠嬬粛浜嗗浣曞HBase鐨勬绱㈠拰搴忓垪鍖栥?/span>HBase鍜?/span>Spark涔嬮棿鐨勬暟鎹湰鍦板寲浠ュ強Spark璁$畻鏂歸潰鐨勪紭鍖栨帾鏂姐?/span>
http://blog.cloudera.com/blog/2016/04/inside-wargamings-data-driven-real-time-rules-engine/
InfoQ鍙戝竷浜嗗ぇ瑙勬ā嫻佸紡澶勭悊—SMACK錛?/span>Spark銆?/span>Mesos銆?/span>Akka銆?/span>Cassandra浠ュ強 Kafka錛夋爤鐨勪粙緇嶈棰戙傝璁轟簡涓轟粈涔?/span>SMACK鏍堝湪澶勭悊鍚屾牱闂鐨勬椂鍊欐瘮Lambda鏋舵瀯鏇寸畝鍗曘?/span>
http://www.infoq.com/presentations/stream-analytics-scalability
Confluent“鏃ュ織鍘嬬緝”緋誨垪鍗氭枃鍙堟湁鏇存柊錛屼粙緇嶄簡Kafka欏圭洰涓夋湀浠藉彂鐢熺殑浜嬫儏銆傛湁涓嶅皯浠や漢鍏蟲敞鐨勫紑鍙戝唴瀹癸紝鍖呮嫭鏈烘灦鎰熺煡銆?/span>Kerberos鏀寔銆佸熀浜庢椂闂寸儲寮曟柟闈㈢殑榪涘睍銆備互鍙婁笉灝戜綘錛堟垜涔熸槸錛夋病鏈夋椂闂存寔緇叧娉ㄧ殑鏈鏂扮爺鍙戞垚鏋溿?/span>
Apache Flink 1.0寮曞叆浜嗘柊鐨勫鏉備簨浠跺鐞嗭紙CEP錛夊簱銆傚暟鍡﹀嚑鍙ワ紝CEP鎻愪緵浜嗕竴縐嶆嫻嬩簨浠舵ā寮忕殑鏂規硶銆傛湰鏂囧熷姪浼犳劅鍣ㄤ粠鏁版嵁涓績鏈嶅姟鍣ㄤ笂鏀墮泦鏁版嵁錛岃繍鐢ㄤ竴縐嶅彲鑳界殑寮傚父媯嫻嬬敤渚嬶紝璇犻噴浜?/span>Flink鐨?/span>CEP妯″紡API 銆?/span>
http://flink.apache.org/news/2016/04/06/cep-monitoring.html
Genome Analysis Toolkit 錛?/span>GATK錛夋渶榪戝甯冿紝涓嬩竴涓増鏈紙褰撳墠鏄?/span>alpha錛夊皢鏀寔Apache Spark銆傛湰鏂囩畝瑕佷粙緇嶄簡宸ュ叿綆卞茍灞曠ず浜嗘庢牱閫氳繃Spark鏉ユ嫻嬮噸澶?/span>DNA鐗囨鐨勩?/span>
InfoWorld緇艱堪浜?/span>Spark2.0鍏充簬緇撴瀯鍖栨祦寮忓鐞嗘柟闈㈢殑璁″垝銆傚井鎵瑰鐞嗗皢渚濈劧寤剁畫錛岃繕鏈変簺鏂扮壒鎬э紝渚嬪鏃犻檺鏁版嵁甯э紙Infinite DataFrames錛夈佷竴嫻佺殑閲嶅鏌ヨ鏀寔銆?/span>
AWS澶ф暟鎹崥瀹㈠彂甯冧簡涓綃囬氳繃瀛樺偍鍦?/span>AWS Key Management Service 錛?/span>KMS錛変腑鐨勫姞瀵嗗瘑閽ュ姞杞芥暟鎹埌S3鍜?/span>Redshift鐨勬枃绔犮傞櫎浜嗘弿榪版墍闇姝ラ錛屾湰鏂囪繕浠嬬粛浜嗗浣曞湪AWS S3涓氳繃KMS瀵嗛挜鍔犲瘑鏁版嵁銆?/span>
Confluent鍗氬浠嬬粛浜嗗浣曚嬌鐢?/span>Kafka Connect 鍜?/span> Kafka Streams 緙栧啓闈炲嚒鐨?/span>“hello world”紼嬪簭銆傛洿紜垏鍦拌錛岃寖渚嬬▼搴忎粠IRC鎷夌淮鍩虹櫨縐戞暟鎹紝騫惰В鏋愭秷鎭佽繘琛屽鏂歸潰鐨勭粺璁¤綆椼傛湰鏂囪繕鐢ㄤ簡鑻ュ共紼嬪簭灞曠ず浜嗘暣涓疄鐜拌繃紼嬨?/span>
http://www.confluent.io/blog/hello-world-kafka-connect-kafka-streams
鏈枃浠?/span>Postgres 鍚?/span> Cassandra杞崲綆鍗曠殑妯″紡錛?/span>schemas錛夛紝騫舵弿榪頒簡涓昏鐨勫樊寮?/span>—澶嶅埗銆佹暟鎹被鍨嬶紙Cassandra涓嶆敮鎸?/span>JSON錛夈佷富閿佹渶緇堜互涓鑷存с?/span>
http://neovintage.org/2016/04/07/data-modeling-in-cassandra-from-a-postgres-perspective/
ESG鍗氬鎶ュ浜嗘渶榪?/span>Strata+Hadoop World澶т細鐨勬儏鍐點傚茍鏈変簺閲嶇偣鍏蟲敞錛屼緥濡?/span>Spark鐨勮壇濂藉娍澶淬佹満鍣ㄥ涔犮佷簯鏈嶅姟銆?/span>
http://blog.esg-global.com/riding-high-at-stratahadoop-world
InformationWeek涔熸姤瀵間簡Strata澶т細錛屽叧娉ㄤ簡MapR鍜?/span>Pivotal鐨勫叧鐏墖銆佷漢宸ユ櫤鑳界瓑銆?/span>
Spark Summit 2016璁▼鏁插畾錛屽皢浜?/span>6鏈?/span>6-8鏃ュ湪鏃ч噾灞變婦琛屻備細璁皢鏈変袱澶╁睍寮浜斾釜鏂瑰悜鐨勮璁恒?/span>
https://databricks.com/blog/2016/04/04/agenda-announced-for-sparksummit-2016-in-san-francisco.html
紱忓竷鏂噰璁夸簡Cloudera CEO Tom Reilly錛屼粬璁ㄨ浜嗗叕鍙哥殑鏈洪亣銆佺珵浜夋у競鍦恒佷笂甯傝鍒掔瓑銆?/span>
Datanami鎾版枃灝嗘鍦ㄥ礇璧風殑Apache Kafka浣滀負嫻佸紡澶勭悊鐨勬敮鏌便傛枃绔犺繕閲囪浜?/span>Confluent鑱斿悎鍒涘浜哄吋CTO Neha Narkhede錛屽潑闂村ス琛ㄧず鏈榪戝皢鎺ㄥ嚭Kafka Connect 鍜?/span> Kafka Streams銆?/span>
http://www.datanami.com/2016/04/06/real-time-rise-apache-kafka/
HBaseCon灝嗕簬5鏈?/span>24鏃ュ湪鏃ч噾灞卞彫寮錛屾渶榪戣紼嬫墠姝e紡瀹e竷銆傚湪涓変釜鏂瑰悜涓婏紝灝嗘湁20涓互涓婄殑璁瑕佽璁恒?/span>
http://blog.cloudera.com/blog/2016/04/hbasecon-2016-speaker-lineup-announced/
Apache HBase 0.98.18 鍜?/span>1.1.4鏈榪戦兘鍙戝竷浜嗐?/span>1.1.4涓婃湁鍖呮嫭涔濅釜鎴栨紜у湪鍐呯殑鑻ュ共淇銆?/span>HBase 0.98.18緹炵瓟絳旂殑浠呰В鍐充簡50涓棶棰橈紙bug銆佹敼鍠勪袱涓柊鐗規э級銆?/span>
http://mail-archives.apache.org/mod_mbox/hbase-user/201603.mbox/%3CCANZa%3DGu-mAxKEtfoRjctHcE0KD7z52oE010Fgsf6AMmW2tDZLA%40mail.gmail.com%3E
http://mail-archives.apache.org/mod_mbox/hbase-user/201603.mbox/%3CCA%2BRK%3D_CtZ1L07nS6Og2ekfVwet0qTE7jw-bmyD2pp5UPweUehQ%40mail.gmail.com%3E
Apache Lens鍙戝竷浜?/span>2.5.0-beta錛屼綔涓虹粺涓鍒嗘瀽鎺ュ彛錛屽畠宸茬粡鏀寔Hadoop鐢熸佺郴緇熺殑鎵ц寮曟搸鏁版嵁瀛樺偍浜嗐傛湰嬈″彂甯冭В鍐充簡87紲紝涓昏鏄?/span>bug淇鍜屽疄鐜版柊鍔熻兘銆?/span>
Airbnb 寮婧愪簡 Caravel錛屾暟鎹帰绱㈢郴緇燂紙鏁版嵁鍙鍖栧鉤鍙幫級銆?/span>Caravel鏀寔澶氱鍦ㄥ晢涓氫駭鍝佷笂鎵嶈兘鐪嬪埌鐨勭壒鎬э紝鑳藉榪炴帴鍒頒換鎰忓彧瑕佹敮鎸?/span>SQL鏂硅█鐨勭郴緇熴傚挨鍏跺畠鏀寔闈㈠悜Druid鐨勫疄鏃跺垎鏋愩?/span>
https://medium.com/airbnb-engineering/caravel-airbnb-s-data-exploration-platform-15a72aa610e5
MapR 瀹e竷鏀寔Apache Drill 1.6浣滀負浠栦滑鐨勫垎甯冨紡緋葷粺銆傛瘮杈冩湁浜偣鐨勫彂甯冩湁MapR-DB鏂板瓨鍌ㄦ彃浠躲佹柊SQL紿楀彛鍑芥暟鏀寔浠ュ強绔绔畨鍏ㄣ傚湪緗戦〉浠嬬粛閮ㄥ垎錛屾湁浜涗嬌鐢?/span>MapR-DB API鍔?/span>杞?/span>鏁版嵁騫墮?/span>榪?/span>Drill鏌ヨ鐨勪緥瀛愩?/span>
Apache Flink鍙戝竷浜嗕慨澶?/span>bug鍚庣殑1.0.x銆傝繖嬈″彂甯冭В鍐充簡23涓棶棰橈紝鎺ㄨ崘鎵鏈?/span>1.0.0鐨勭敤鎴峰崌綰с?/span>
http://flink.apache.org/news/2016/04/06/release-1.0.1.html
Cloudera Enterprise 5.7鍙戝竷闄勫甫浜?/span>Spark銆?/span>HBase銆?/span>Impala銆?/span>Kafka絳夌粍浠剁増鏈殑鍗囩駭銆傛湰嬈″彂甯冪殑浜偣鍖呮嫭浠?/span>Cloudera Labs 鏂伴矞鎺ㄨ崘鐨?/span>Hive-on-Spark銆?/span>HBase-Spark銆?/span>Impala鎬ц兘閲嶈鎻愬崌錛屾敮鎸?/span>SSD 涓?/span>HBase WAL銆?/span>
http://blog.cloudera.com/blog/2016/04/cloudera-enterprise-5-7-is-released/
Apache Tajo錛屾瀯寤哄湪Hadoop涓婄殑鏁版嵁浠撳簱緋葷粺錛屽彂甯冧簡0.11.2鐗堛傛柊鐗堟湰鏀寔浜?/span>Kerberos錛屼慨澶嶄簡ORC琛ㄥHive鐨勬敮鎸佺瓑銆?/span>
http://tajo.apache.org/releases/0.11.2/announcement.html
LinkedIn 寮婧愪簡 Dr. Elephant錛岄噷闈㈢殑宸ュ叿鑳借瘖鏂?/span>Hadoop鍜?/span>Spark浠誨姟鐨勬ц兘闂銆傚熀浜?/span>metrics浠?/span>YARN璧勬簮綆$悊鍣ㄦ敹闆嗗凡瀹屾垚浠誨姟鏁版嵁錛?/span>Dr. Elephant璇勪及鍚庣敓鎴愯瘖鏂姤琛紝鍐呭鍖呮嫭鏁版嵁閿欎綅銆?/span>GC寮閿絳夈?/span>LinkedIn瀹gО鍊熷姪瀹冭兘瑙e喅80%鐨勯棶棰樸?/span>
涓浗
鏃?/span>