??xml version="1.0" encoding="utf-8" standalone="yes"?>久久亚洲精品国产亚洲老地址,麻豆亚洲AV成人无码久久精品 ,国产亚洲一区二区三区在线观看http://m.tkk7.com/rosen/zh-cnFri, 09 May 2025 04:57:23 GMTFri, 09 May 2025 04:57:23 GMT60Hadoop周刊—第 176 ?/title><link>http://m.tkk7.com/rosen/archive/2016/07/12/431174.html</link><dc:creator>Rosen</dc:creator><author>Rosen</author><pubDate>Tue, 12 Jul 2016 13:21:00 GMT</pubDate><guid>http://m.tkk7.com/rosen/archive/2016/07/12/431174.html</guid><wfw:comment>http://m.tkk7.com/rosen/comments/431174.html</wfw:comment><comments>http://m.tkk7.com/rosen/archive/2016/07/12/431174.html#Feedback</comments><slash:comments>0</slash:comments><wfw:commentRss>http://m.tkk7.com/rosen/comments/commentRss/431174.html</wfw:commentRss><trackback:ping>http://m.tkk7.com/rosen/services/trackbacks/431174.html</trackback:ping><description><![CDATA[<p align="left" style="line-height: 10%;"><strong> </strong></p> <p align="left" style="line-height: 10%;"><strong><span style="font-size:16.0pt;line-height:10%">Hadoop</span></strong><strong><span style="font-size:16.0pt;line-height:10%;font-family:宋体;">周刊</span></strong><strong> </strong><strong><span style="font-size:16.0pt;line-height: 10%;font-family:宋体;">W?/span></strong><strong><span style="font-size:16.0pt;line-height:10%"> 176 </span></strong><strong><span style="font-size:16.0pt;line-height: 10%;font-family:宋体;">?/span></strong><strong></strong></p> <p align="left" style="line-height: 10%;"> </p> <p align="left" style="line-height: 10%;"> </p> <p align="left" style="line-height: 10%;"><span style="font-size:14.0pt;line-height:10%;font-family:宋体;">启明星辰q_和大数据Ml编?/span></p> <p align="left" style="line-height: 10%;"> </p> <p align="left" style="line-height: 10%;"> </p> <p align="left" style="line-height: 10%;"><span style="font-size:14.0pt;line-height:10%">2016</span><span style="font-size:14.0pt;line-height:10%;font-family:宋体;">q?/span><span style="font-size:14.0pt;line-height:10%">6</span><span style="font-size:14.0pt;line-height:10%;font-family:宋体;">?/span><span style="font-size:14.0pt;line-height:10%">29</span><span style="font-size:14.0pt;line-height:10%;font-family:宋体;">?/span></p> <p> </p> <p><span style="font-family:Helvetica;">Hadoop</span><span style="font-family:宋体;">C本周在圣何塞召开Q所以很期待在下期周刊看到新目的发布和_ֽ演讲Q请向我们提供Q何相关的qȝ片)。至于本期周刊,有大量关?/span><span style="font-family:Helvetica;">Kafka Streams</span><span style="font-family:宋体;">、从</span><span style="font-family:Helvetica;">Amazon Kinesis</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Google BigQuery</span><span style="font-family:宋体;">传递流式数据?/span><span style="font-family:Helvetica;">Google</span><span style="font-family:宋体;">数据集搜索系l的文章?/span></p> <p> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">技术新?/span></strong><strong></strong></p> <p><span style="font-family:Helvetica;">Shine</span><span style="font-family:宋体;">介绍了他们如何?/span><span style="font-family:Helvetica;">Amazon Lambda</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Amazon Kinesis</span><span style="font-family:宋体;">Q以及ؓ</span><span style="font-family: Helvetica;">Apache web</span><span style="font-family: 宋体;">服务器提供的</span><span style="font-family: Helvetica;">Kinesis</span><span style="font-family: 宋体;">代理Q用于采日志Q?/span><span style="font-family:宋体;">Q以及从</span><span style="font-family:Helvetica;">EC2</span><span style="font-family:宋体;">Ud数据?/span><span style="font-family:Helvetica;">Google BigQuery</span><span style="font-family: 宋体;">的内宏V本文提供了</span><span style="font-family:Helvetica;">Lambda</span><span style="font-family:宋体;">函数Q?/span><span style="font-family:Helvetica;">javascript</span><span style="font-family:宋体;">~写Q代码片D,规模和开销斚w的信息,描述了如何通过</span><span style="font-family:Helvetica;">gzip</span><span style="font-family:宋体;">压羃数据从而优化传输开销?/span></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">https://blog.shinetech.com/2016/06/21/kinesis-lambda-bigquery/</span></a></p> <p> </p> <p><span style="font-family:Helvetica;">Cloudera</span><span style="font-family:宋体;">博客撰文介绍了如何通过</span><span style="font-family: Helvetica;">Apache Spark</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Apache Impala</span><span style="font-family:宋体;">Q孵化中Q?/span><span style="font-family:Helvetica;">Hue</span><span style="font-family:宋体;">Ҏ之队数据q行分析。本文主要聚焦在分析上,附带了些</span><span style="font-family:Helvetica;">Spark</span><span style="font-family:宋体;">代码以及</span><span style="font-family:Helvetica;">Hue</span><span style="font-family:宋体;">的功能演C?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://blog.cloudera.com/blog/2016/06/how-to-analyze-fantasy-sports-with-apache-spark-and-sql-part-2-data-exploration/</span></p> <p> </p> <p><span style="font-family:Helvetica;">KDnuggets</span><span style="font-family:宋体;">撰文介绍?/span><span style="font-family:Helvetica;">13</span><span style="font-family:宋体;">个和</span><span style="font-family:Helvetica;">Apache Spark</span><span style="font-family:宋体;">相关的主?/span><span style="font-family:Helvetica;">API/</span><span style="font-family:宋体;">目</span><span style="font-family:Helvetica;">/</span><span style="font-family:宋体;">名词。包?/span><span style="font-family:Helvetica;">RDD</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">DataFrame</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Dataset</span><span style="font-family:宋体;">、结构化式计算?/span><span style="font-family:Helvetica;">GraphX</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Tungsten</span><span style="font-family:宋体;">。每个条目都有一D늫节介l,_很好的了?/span><span style="font-family:Helvetica;">Spark</span><span style="font-family:宋体;">主要Ҏ了?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://www.kdnuggets.com/2016/06/spark-key-terms-explained.html</span></p> <p> </p> <p><span style="font-family:宋体;">本文来自</span><span style="font-family:Helvetica;">Confluent</span><span style="font-family:宋体;">博客Q介l了那些虽看h单却又不单的</span><span style="font-family:Helvetica;">Kafka Streams</span><span style="font-family:宋体;">应用。例如用</span><span style="font-family:Helvetica;">Kafka Streams</span><span style="font-family:宋体;">~写l合用户点击数据和用户位置数据的程序。后者存储在</span><span style="font-family:Helvetica;">KTable</span><span style="font-family:宋体;">中,</span><span style="font-family:Helvetica;">KTable</span><span style="font-family:宋体;">提供了类似带有数据库表主键的抽象Q主键的最新值通过</span><span style="font-family:Helvetica;">API</span><span style="font-family:宋体;">暴露Q。最后的E序倒是?/span><span style="font-family:Helvetica;">——</span><span style="font-family:宋体;">只有几行代码?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://www.confluent.io/blog/distributed-real-time-joins-and-aggregations-on-user-activity-events-using-kafka-streams</span></p> <p> </p> <p><span style="font-family:Helvetica;">Cloudera</span><span style="font-family:宋体;">博客撰文介绍?/span><span style="font-family:Helvetica;">meinstadt.de</span><span style="font-family:宋体;">构徏?/span><span style="font-family:Helvetica;">Apache Flume</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Apache Spark Streaming</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Apache Impala</span><span style="font-family:宋体;">Q孵化中Q上?/span><span style="font-family:Helvetica;">HTTP</span><span style="font-family:宋体;">h异常系l。实C码放在了</span><span style="font-family: Helvetica;">github</span><span style="font-family:宋体;">上?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://blog.cloudera.com/blog/2016/06/how-to-detect-and-report-web-traffic-anomalies-in-near-real-time/</span></p> <p> </p> <p><span style="font-family:Helvetica;">AWS</span><span style="font-family:宋体;">大数据博客有教程介绍了如何?/span><span style="font-family: Helvetica;">Apache Spark</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Apache Zeppelin</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Amazon EMR</span><span style="font-family:宋体;">集群处理</span><span style="font-family:Helvetica;">Amazon Kinesis</span><span style="font-family:宋体;">数据。本文包含了一些通过</span><span style="font-family:Helvetica;">Zeppelin notebook</span><span style="font-family:宋体;">q行</span><span style="font-family:Helvetica;">SQL</span><span style="font-family:宋体;">产生的数据可视化范例?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://blogs.aws.amazon.com/bigdata/post/Tx3K805CZ8WFBRP/Analyze-Realtime-Data-from-Amazon-Kinesis-Streams-Using-Zeppelin-and-Spark-Strea</span></p> <p> </p> <p><span style="font-family:Helvetica;">Apache Kudu</span><span style="font-family: 宋体;">Q孵化中Q接q?/span><span style="font-family:Helvetica;">1.0</span><span style="font-family:宋体;">版发布了Q将全面支持高可用性。本文介l了q最后一块拼?/span><span style="font-family:Helvetica;">“</span><span style="font-family:宋体;">d?/span><span style="font-family:Helvetica;">”</span><span style="font-family:宋体;">是如何实现的。晒了下</span><span style="font-family:Helvetica;">JIRA</span><span style="font-family:宋体;">上各U问题的跟进的情况,以及完成与剩余的试?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://kudu.apache.org/2016/06/24/multi-master-1-0-0.html</span></p> <p> </p> <p><span style="font-family:Helvetica;">Google</span><span style="font-family:宋体;">的所有数据^台拥有超q?/span><span style="font-family: Helvetica;">260</span><span style="font-family:宋体;">亿的数据集,每天要添加和删除</span><span style="font-family:Helvetica;">16</span><span style="font-family:宋体;">亿的数据集\径。ؓ了跟t、查询、比较数据集Q他们研发了</span><span style="font-family: Helvetica;">Google Dataset Search</span><span style="font-family:宋体;">Q?/span><span style="font-family:Helvetica;">GOODS</span><span style="font-family:宋体;">Q?/span><span style="font-family:Helvetica;">GOODS</span><span style="font-family:宋体;">跟踪?/span><span style="font-family:Helvetica;">API</span><span style="font-family:宋体;">暴露的元数据Q这些元数据被用于检索、监控等?/span></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration: none;text-underline:none">http://dl.acm.org/citation.cfm?id=2903730</span></a></p> <p> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">其他新闻</span></strong><strong></strong></p> <p><span style="font-family:Helvetica;">SiliconAngle</span><span style="font-family: 宋体;">采访?/span><span style="font-family:Helvetica;">Hortonworks CEO Rob Bearden</span><span style="font-family:宋体;">。主题包括业界趋ѝ?/span><span style="font-family:Helvetica;">Hortonworks</span><span style="font-family:宋体;">财务?/span><span style="font-family:Helvetica;">Hortonworks</span><span style="font-family: 宋体;">的非</span><span style="font-family:Helvetica;">Hadoop</span><span style="font-family:宋体;">技术以及物联网?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://siliconangle.com/blog/2016/06/24/hadoop-and-beyond-a-conversation-with-hortonworks-ceo-rob-bearden/</span></p> <p> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">产品发布</span></strong><strong></strong></p> <p align="left"><span style="font-family:Helvetica;">Apache Sentry</span><span style="font-family:宋体;">本周发布?/span><span style="font-family:Helvetica;">1.7.0</span><span style="font-family:宋体;">版,修复?/span><span style="font-family:Helvetica;">bug</span><span style="font-family:宋体;">Q增加了新特性和其他斚w的提升。本ơ发布把</span><span style="font-family:Helvetica;">Hive</span><span style="font-family:宋体;">授权框架升CW二版?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">http://mail-archives.us.apache.org/mod_mbox/www-announce/201606.mbox/%3CCAPOmu3sDqdzu9ntDSvkMaDRQnVfHrkGV5qhyh-ZRiMmwgMMvBA@mail.gmail.com%3E</span></p> <p align="left"> </p> <p align="left"><span style="font-family:宋体;">Z</span><span style="font-family:Helvetica;">Apache Cassandra 3.0</span><span style="font-family:宋体;">构徏?/span><span style="font-family:Helvetica;">DataStax Enterprise 5.0</span><span style="font-family:宋体;">Q增加了对图数据、分层存储?/span><span style="font-family:Helvetica;">Cassandra</span><span style="font-family:宋体;">多实例的支持。本ơ发布也增加了诸如加密和Z角色讉K控制的附加安全特性支持?/span></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">https://www.datastax.com/2016/06/introducing-datastax-enterprise-5-0</span></a></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">Driven</span><span style="font-family:宋体;">Q大数据应用性能监控pȝ发布?/span><span style="font-family:Helvetica;">2.2</span><span style="font-family:宋体;">版。本ơ发布的亮点是对</span><span style="font-family:Helvetica;">Apache Spark</span><span style="font-family:宋体;">的监控提供了支持?/span></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">http://www.driven.io/2016/06/driven-inc-delivering-hadoop-spark-performance-monitoring-announces-driven-2-2/</span></a></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">BlueData</span><span style="font-family:宋体;">发布了他们ؓ</span><span style="font-family:Helvetica;">Amazon Web Services</span><span style="font-family:宋体;">提供?/span><span style="font-family:Helvetica;">EPIC</span><span style="font-family:宋体;">企业大数据既服务产品。本产品通过单的点击p自动装蝲到基?/span><span style="font-family: Helvetica;">Docker</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Hadoop</span><span style="font-family:宋体;">集群?/span></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">http://www.bluedata.com/blog/2016/06/big-data-as-a-service-on-prem-or-cloud-bdaas/</span></a></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">Apache Accumulo</span><span style="font-family:宋体;">发布?/span><span style="font-family:Helvetica;">1.7.2</span><span style="font-family:宋体;">版。本ơ发布修复了</span><span style="font-family:Helvetica;">write-ahead</span><span style="font-family:宋体;">日志处理方式Q优化了</span><span style="font-family:Helvetica;">RFiles</span><span style="font-family:宋体;">Q以及性能上的提升?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">https://accumulo.apache.org/release_notes/1.7.2.html</span></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">Apache ZooKeeper</span><span style="font-family:宋体;">的顶U?/span><span style="font-family:Helvetica;">SDK</span><span style="font-family:宋体;">Q?/span><span style="font-family:Helvetica;">Apache Curator</span><span style="font-family:宋体;">发布?/span><span style="font-family:Helvetica;">2.11.0</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">3.2.0</span><span style="font-family:宋体;">版?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">https://cwiki.apache.org/confluence/display/CURATOR/Releases#Releases-June23,2016,Releases2.11.0and3.2.0available</span></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">Apache Hive</span><span style="font-family:宋体;">发布?/span><span style="font-family:Helvetica;">2.1.0</span><span style="font-family:宋体;">版。修复了大量</span><span style="font-family:Helvetica;">bug</span><span style="font-family:宋体;">和功能增强,包括?/span><span style="font-family:Helvetica;">Hive</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Live Longer</span><span style="font-family: 宋体;">?/span><span style="font-family:Helvetica;">Prosper </span><span style="font-family:宋体;">改进和以?/span><span style="font-family:Helvetica;">JDBC</span><span style="font-family:宋体;">支持?/span></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">http://mail-archives.us.apache.org/mod_mbox/www-announce/201606.mbox/%3C7194557D-CB5E-45B7-B905-82F27B7CB33F@apache.org%3E</span></a></p> <p align="left"> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">zd</span></strong><strong></strong></p> <p align="left"><span style="font-size:14.0pt;font-family:SimSun;">中国</span></p> <p align="left"><span style="font-family:Helvetica;">7</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">2</span><span style="font-family:宋体;">?/span> <span style="font-family:宋体;">上v</span><span style="font-family:Helvetica;">BigData Streaming</span><span style="font-family: 宋体;">W三ơ见面会</span></p><img src ="http://m.tkk7.com/rosen/aggbug/431174.html" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://m.tkk7.com/rosen/" target="_blank">Rosen</a> 2016-07-12 21:21 <a href="http://m.tkk7.com/rosen/archive/2016/07/12/431174.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>Hadoop周刊—第 175 ?/title><link>http://m.tkk7.com/rosen/archive/2016/07/01/431070.html</link><dc:creator>Rosen</dc:creator><author>Rosen</author><pubDate>Fri, 01 Jul 2016 07:44:00 GMT</pubDate><guid>http://m.tkk7.com/rosen/archive/2016/07/01/431070.html</guid><wfw:comment>http://m.tkk7.com/rosen/comments/431070.html</wfw:comment><comments>http://m.tkk7.com/rosen/archive/2016/07/01/431070.html#Feedback</comments><slash:comments>0</slash:comments><wfw:commentRss>http://m.tkk7.com/rosen/comments/commentRss/431070.html</wfw:commentRss><trackback:ping>http://m.tkk7.com/rosen/services/trackbacks/431070.html</trackback:ping><description><![CDATA[<p align="left" style="line-height: 10%;"><strong> </strong></p> <p align="left" style="line-height: 10%;"><strong><span style="font-size:16.0pt;line-height:10%">Hadoop</span></strong><strong><span style="font-size:16.0pt;line-height:10%;font-family:宋体;">周刊</span></strong><strong> </strong><strong><span style="font-size:16.0pt;line-height: 10%;font-family:宋体;">W?/span></strong><strong><span style="font-size:16.0pt;line-height:10%"> 175 </span></strong><strong><span style="font-size:16.0pt;line-height: 10%;font-family:宋体;">?/span></strong><strong></strong></p> <p align="left" style="line-height: 10%;"> </p> <p align="left" style="line-height: 10%;"> </p> <p align="left" style="line-height: 10%;"><span style="font-size:14.0pt;line-height:10%;font-family:宋体;">启明星辰q_和大数据Ml编?/span></p> <p align="left" style="line-height: 10%;"> </p> <p align="left" style="line-height: 10%;"> </p> <p align="left" style="line-height: 10%;"><span style="font-size:14.0pt;line-height:10%">2016</span><span style="font-size:14.0pt;line-height:10%;font-family:宋体;">q?/span><span style="font-size:14.0pt;line-height:10%">6</span><span style="font-size:14.0pt;line-height:10%;font-family:宋体;">?/span><span style="font-size:14.0pt;line-height:10%">19</span><span style="font-size:14.0pt;line-height:10%;font-family:宋体;">?/span></p> <p> </p> <p><span style="font-family:Helvetica;">Hadoop</span><span style="font-family:宋体;">C已过M周了Q我们已看到有多个品(目Q敲定了发布旉。所以在技术新闻部分,有关?/span><span style="font-family:Helvetica;">Hadoop Kerberos</span><span style="font-family:宋体;">认证的内容另外还?/span><span style="font-family:Helvetica;">Salsify</span><span style="font-family:宋体;">应用</span><span style="font-family:Helvetica;">Avro</span><span style="font-family:宋体;">的文章。在产品发布部分Q包?/span><span style="font-family: Helvetica;">Yandex</span><span style="font-family:宋体;">新近开源的列式数据库在内的多个目均有新版本发布?/span></p> <p> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">技术新?/span></strong><strong></strong></p> <p><span style="font-family:Helvetica;">OpenCore</span><span style="font-family:宋体;">博客撰文C了多U?/span><span style="font-family:Helvetica;">Hadoop Kerberos</span><span style="font-family:宋体;">认证协议调试工具。尤其示范了如何使用</span><span style="font-family:Helvetica;">UserGropuInformation</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">“main()”</span><span style="font-family:宋体;">Ҏ导出一些有用的调试信息?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://www.opencore.com/blog/2016/5/user-name-handling-in-hadoop/</span></p> <p> </p> <p><span style="font-family:Helvetica;">YARN</span><span style="font-family:宋体;">pd文章的第四部分,</span><span style="font-family: Helvetica;">Cloduera</span><span style="font-family:宋体;">博客介绍了如何配|公q度队列。尤其对资源U束讄、队列安|策略和抢占q行了详解?/span></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">http://blog.cloudera.com/blog/2016/06/untangling-apache-hadoop-yarn-part-4-fair-scheduler-queue-basics/</span></a></p> <p> </p> <p><span style="font-family:Helvetica;">Salsify</span><span style="font-family:宋体;">Z</span><span style="font-family:Helvetica;">Apache Kafka</span><span style="font-family:宋体;">构徏了一个异步微服务架构Qƈ采用</span><span style="font-family:Helvetica;">Apache Avro</span><span style="font-family:宋体;">q行数据序列化。该应用使用</span><span style="font-family:Helvetica;">Ruby</span><span style="font-family:宋体;">开发,他们创徏了多个新工具使得</span><span style="font-family:Helvetica;">Avro</span><span style="font-family:宋体;">能和</span><span style="font-family:Helvetica;">Ruby</span><span style="font-family:宋体;">语言很好的配合。本文介l了q些工具和它们的价|</span><span style="font-family:Helvetica;">avro-builder</span><span style="font-family:宋体;">用于定义记录、基?/span><span style="font-family:Helvetica;">postgres</span><span style="font-family:宋体;">的模式注册表Q?/span><span style="font-family:Helvetica;">avromatic</span><span style="font-family:宋体;">则从</span><span style="font-family:Helvetica;">avro schema</span><span style="font-family:宋体;">生成模型?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://blog.salsify.com/engineering/adventures-in-avro</span></p> <p> </p> <p><span style="font-family:Helvetica;">Apache Drill</span><span style="font-family: 宋体;">可以动态推断模式,q支持多模式</span><span style="font-family: Helvetica;">(</span><span style="font-family:宋体;">但相互兼?/span><span style="font-family:Helvetica;">)</span><span style="font-family:宋体;">数据。这U组合得一些有的用例得以实现Q例如跨多个不同模式?/span><span style="font-family: Helvetica;">json</span><span style="font-family:宋体;">文g查询?/span><span style="font-family:Helvetica;">MapR</span><span style="font-family:宋体;">博客探究了这些特性ƈq行了示范?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://www.mapr.com/blog/sql-query-mixed-schema-data-using-apache-drill</span></p> <p> </p> <p><span style="font-family:宋体;">本教E展CZ如何?/span><span style="font-family:Helvetica;">Druid</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Apache Kafka</span><span style="font-family: 宋体;">l合构徏式分析和可视化Q借助</span><span style="font-family: Helvetica;">Pivot</span><span style="font-family:宋体;">Q?/span><span style="font-family:Helvetica;">Druid</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">web UI</span><span style="font-family:宋体;">Q应用?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://www.confluent.io/blog/building-a-streaming-analytics-stack-with-apache-kafka-and-druid</span></p> <p> </p> <p><span style="font-family:Helvetica;">Apache Beam</span><span style="font-family: 宋体;">Q孵化中Q博客撰文介l了他们在连?/span><span style="font-family:Helvetica;">Apache Flink</span><span style="font-family:宋体;">批处理集方面的成果?/span><span style="font-family:Helvetica;">Beam</span><span style="font-family:宋体;">是一个开?/span><span style="font-family:Helvetica;">SDK</span><span style="font-family:宋体;">Q最初来自于</span><span style="font-family:Helvetica;">Google</span><span style="font-family:宋体;">Q用于暴露后端未知数据管?/span><span style="font-family:Helvetica;">API</span><span style="font-family:宋体;">?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://beam.incubator.apache.org/blog/2016/06/13/flink-batch-runner-milestone.html</span></p> <p> </p> <p><span style="font-family:Helvetica;">Cask Hydrator</span><span style="font-family: 宋体;">是一个通过</span><span style="font-family:Helvetica;">UI</span><span style="font-family:宋体;">界面采用拖拽方式构徏数据道的工兗本教程也演CZ如何使用</span><span style="font-family:Helvetica;">Hydrator</span><span style="font-family:宋体;">把数据从</span><span style="font-family:Helvetica;">MySQL</span><span style="font-family:宋体;">导入?/span><span style="font-family:Helvetica;">HDFS</span><span style="font-family:宋体;">?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://blog.cask.co/2016/06/bringing-relational-data-into-data-lakes/</span></p> <p> </p> <p><span style="font-family:Helvetica;">Databricks</span><span style="font-family:宋体;">撰文介绍了即发布的</span><span style="font-family: Helvetica;">Apache Spark 2.0</span><span style="font-family:宋体;">中新?/span><span style="font-family:Helvetica;">SQL</span><span style="font-family:宋体;">子查询功能。有的是,本文以手册Ş式呈玎ͼ最直截了当的展C代码和范例数据?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://databricks.com/blog/2016/06/17/sql-subqueries-in-apache-spark-2-0.html</span></p> <p> </p> <p><span style="font-family:Helvetica;">Apache Kudu</span><span style="font-family: 宋体;">Q孵化中Q博客撰写了在单集群节点使用</span><span style="font-family:Helvetica;">Raft</span><span style="font-family:宋体;">的文章,借此动态扩展到多主节点集群?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://getkudu.io/2016/06/17/raft-consensus-single-node.html</span></p> <p> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">其他新闻</span></strong><strong></strong></p> <p><span style="font-family:宋体;">本文指出</span><span style="font-family:Helvetica;">Apache Spark</span><span style="font-family:宋体;">C֌如果不用心经营,可能会重走因片化导?/span><span style="font-family:Helvetica;">Apache Hadoop</span><span style="font-family:宋体;">生态系l؜q老\。D例来_最新版本的</span><span style="font-family:Helvetica;">CDH</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">HDP</span><span style="font-family:宋体;">支持不同版本?/span><span style="font-family:Helvetica;">Spark</span><span style="font-family:宋体;">?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://techcrunch.com/2016/06/12/spark-fragmentation-undermines-community/</span></p> <p> </p> <p><span style="font-family:Helvetica;">New Stack</span><span style="font-family:宋体;">撰写了一关?/span><span style="font-family:Helvetica;">Concord</span><span style="font-family:宋体;">的文章,</span><span style="font-family:Helvetica;">Concord</span><span style="font-family:宋体;">是一个构建在</span><span style="font-family:Helvetica;">Apache Mesos</span><span style="font-family: 宋体;">上新的流式处理框Ӟ公开试状态)?/span><span style="font-family:Helvetica;">Concord</span><span style="font-family:宋体;">使用</span><span style="font-family:Helvetica;">C++</span><span style="font-family:宋体;">开发,支持动态拓扑(无需停机实现道的增加和减少Q?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://thenewstack.io/concord-leverages-mesos-high-performance-stream-processing/</span></p> <p> </p> <p><span style="font-family:宋体;">随着</span><span style="font-family:Helvetica;">Databricks</span><span style="font-family:宋体;">C֌版的正式发布Q?/span><span style="font-family:Helvetica;">Databricks</span><span style="font-family:宋体;">发布了?/span><span style="font-family:Helvetica;">Databricks</span><span style="font-family:宋体;">~写</span><span style="font-family:Helvetica;">Apache Spark</span><span style="font-family:宋体;">应用E序pd教程的第一?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://databricks.com/blog/2016/06/15/an-introduction-to-writing-apache-spark-applications-on-databricks.html</span></p> <p> </p> <p><span style="font-family:Helvetica;">Hadoop</span><span style="font-family:宋体;">圣何塞峰会于几周前召开Q期间D行了题ؓ</span><span style="font-family:Helvetica;">“</span><span style="font-family:宋体;">大数据行业中的女?/span><span style="font-family:Helvetica;">”</span><span style="font-family:宋体;">专场午宴?/span><span style="font-family:Helvetica;">Hortonworks</span><span style="font-family: 宋体;">博客Ҏ采访了午宴主持h</span><span style="font-family: Helvetica;">Hortonworks CMO</span><span style="font-family:宋体;">Q?/span><span style="font-family:Helvetica;">Ingrid Burton</span><span style="font-family:宋体;">?/span></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">http://hortonworks.com/blog/summer-hortonworks-part-2-wibd-assertive-innovative-take-risks/</span></a></p> <p> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">产品发布</span></strong><strong></strong></p> <p align="left"><span style="font-family:Helvetica;">Apache SystemML</span><span style="font-family:宋体;">Q孵化中Q最q发布了</span><span style="font-family: Helvetica;">0.10.0</span><span style="font-family:宋体;">版?/span><span style="font-family:Helvetica;">SystemML</span><span style="font-family:宋体;">是一个机器学习框Ӟ由多个项目在背后支撑Q包?/span><span style="font-family:Helvetica;">Apache Spark</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Apache Hadoop</span><span style="font-family:宋体;">。本ơ发布包括新?/span><span style="font-family:Helvetica;">Spark Matrix Block</span><span style="font-family:宋体;">cd、支持深度学习、性能上的提升、新?/span><span style="font-family:Helvetica;">KNN</span><span style="font-family:宋体;">法{等?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">http://systemml.apache.org/0.10.0-incubating/release_notes.html</span></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">Apache Mahout</span><span style="font-family:宋体;">Q另一个机器学习框架发布了</span><span style="font-family: Helvetica;">0.12.2</span><span style="font-family:宋体;">版。本ơ发布向着集成</span><span style="font-family:Helvetica;">Apache Zeppelin</span><span style="font-family:宋体;">可视化和支持</span><span style="font-family:Helvetica;">notebook</span><span style="font-family:宋体;">的目标迈q了一步?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">http://mail-archives.us.apache.org/mod_mbox/www-announce/201606.mbox/%3CCAOtpBjgBAuQs5FiX5X_5A+Rd-A1fVz0R7SKttGe4cJuCLRiGww@mail.gmail.com%3E</span></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">Qubole</span><span style="font-family:宋体;">宣布他们?/span><span style="font-family:Helvetica;">HBase-as-a-Service</span><span style="font-family:宋体;">已经?/span><span style="font-family:Helvetica;">AWS</span><span style="font-family:宋体;">上提供。它为长时运行集提供了许多漂亮的特性。支?/span><span style="font-family:Helvetica;">Hannibal</span><span style="font-family:宋体;">和其它监控工P集成?/span><span style="font-family:Helvetica;">Apache Zeppelin</span><span style="font-family:宋体;">Qƈ能通过节点引导E序?/span><span style="font-family: Helvetica;">OpenTSDB</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Apache Phoenix</span><span style="font-family:宋体;">配置?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">https://www.qubole.com/blog/product/quboles-hbase-as-a-service-is-generally-available-on-aws/</span></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">Altiscale</span><span style="font-family:宋体;">发布?/span><span style="font-family:Helvetica;">Altiscale Insight Cloud</span><span style="font-family:宋体;">实时版。本pȝ?/span><span style="font-family:Helvetica;">Apache HBase</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Spark Streaming</span><span style="font-family:宋体;">支撑?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">https://www.altiscale.com/blog/announcing-the-altiscale-insight-cloud-real-time-edition/</span></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">`hs2client`</span><span style="font-family:宋体;">是一个ؓ</span><span style="font-family:Helvetica;">Apache Hive</span><span style="font-family: 宋体;">?/span><span style="font-family:Helvetica;">Apache Impala</span><span style="font-family:宋体;">Q孵化中Q提供的?/span><span style="font-family:Helvetica;">C++</span><span style="font-family:宋体;">库。除了支?/span><span style="font-family:Helvetica;">C++</span><span style="font-family:宋体;">Q这个库q绑定了</span><span style="font-family:Helvetica;">python</span><span style="font-family:宋体;">Q可以在</span><span style="font-family:Helvetica;">pandas</span><span style="font-family:宋体;">中把数据d</span><span style="font-family:Helvetica;">DataFrame</span><span style="font-family:宋体;">?/span></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">http://blog.cloudera.com/blog/2016/06/announcing-hs2client-a-fast-new-c-python-thrift-client-for-impala-and-hive/</span></a></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">MapR</span><span style="font-family:宋体;">在其发行版中支持?/span><span style="font-family:Helvetica;">Apache Spark 2.0</span><span style="font-family: 宋体;">开发者预览版?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">https://www.mapr.com/blog/spark-20-now-developer-preview-mode-mapr-platform</span></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">Apache Beam</span><span style="font-family:宋体;">发布了其</span><span style="font-family:Helvetica;">0.1.0</span><span style="font-family:宋体;">孵化版,是本目加入</span><span style="font-family: Helvetica;">Apache</span><span style="font-family:宋体;">孵化器以来首ơ发布?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">http://beam.incubator.apache.org/beam/release/2016/06/15/first-release.html</span></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">Yandex</span><span style="font-family:宋体;">开源了</span><span style="font-family:Helvetica;">ClickHouse</span><span style="font-family:宋体;">Q一个列式分析数据库。本pȝ为横向和U向扩展而生。支持复杂数据类型(例如数组Q和q似查询。该团队q发布了与其它数据库相比的基准测试结果?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">https://clickhouse.yandex/</span></p> <p align="left"> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">zd</span></strong><strong></strong></p> <p align="left"><span style="font-size:14.0pt;font-family:SimSun;">中国</span></p> <p align="left"> </p><img src ="http://m.tkk7.com/rosen/aggbug/431070.html" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://m.tkk7.com/rosen/" target="_blank">Rosen</a> 2016-07-01 15:44 <a href="http://m.tkk7.com/rosen/archive/2016/07/01/431070.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>Hadoop周刊—第 174 ?/title><link>http://m.tkk7.com/rosen/archive/2016/06/28/431032.html</link><dc:creator>Rosen</dc:creator><author>Rosen</author><pubDate>Tue, 28 Jun 2016 09:39:00 GMT</pubDate><guid>http://m.tkk7.com/rosen/archive/2016/06/28/431032.html</guid><wfw:comment>http://m.tkk7.com/rosen/comments/431032.html</wfw:comment><comments>http://m.tkk7.com/rosen/archive/2016/06/28/431032.html#Feedback</comments><slash:comments>0</slash:comments><wfw:commentRss>http://m.tkk7.com/rosen/comments/commentRss/431032.html</wfw:commentRss><trackback:ping>http://m.tkk7.com/rosen/services/trackbacks/431032.html</trackback:ping><description><![CDATA[<p align="left" style="line-height: 10%;"><strong> </strong></p> <p align="left" style="line-height: 10%;"><strong><span style="font-size:16.0pt;line-height:10%">Hadoop</span></strong><strong><span style="font-size:16.0pt;line-height:10%;font-family:宋体;">周刊</span></strong><strong> </strong><strong><span style="font-size:16.0pt;line-height: 10%;font-family:宋体;">W?/span></strong><strong><span style="font-size:16.0pt;line-height:10%"> 174 </span></strong><strong><span style="font-size:16.0pt;line-height: 10%;font-family:宋体;">?/span></strong><strong></strong></p> <p align="left" style="line-height: 10%;"> </p> <p align="left" style="line-height: 10%;"> </p> <p align="left" style="line-height: 10%;"><span style="font-size:14.0pt;line-height:10%;font-family:宋体;">启明星辰q_和大数据Ml编?/span></p> <p align="left" style="line-height: 10%;"> </p> <p align="left" style="line-height: 10%;"> </p> <p align="left" style="line-height: 10%;"><span style="font-size:14.0pt;line-height:10%">2016</span><span style="font-size:14.0pt;line-height:10%;font-family:宋体;">q?/span><span style="font-size:14.0pt;line-height:10%">6</span><span style="font-size:14.0pt;line-height:10%;font-family:宋体;">?/span><span style="font-size:14.0pt;line-height:10%">12</span><span style="font-size:14.0pt;line-height:10%;font-family:宋体;">?/span></p> <p> </p> <p><span style="font-family:Helvetica;">Spark</span><span style="font-family:宋体;">C本周在旧金山召开Q正如所料,本期周刊有大量关?/span><span style="font-family:Helvetica;">Apache Spark</span><span style="font-family:宋体;">的新闅R公告和版本发布。除</span><span style="font-family:Helvetica;">Spark</span><span style="font-family:宋体;">外,本期q有</span><span style="font-family:Helvetica;">Kafka</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Cask</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Ambari</span><span style="font-family:宋体;">斚w的文章。在产品发布部分Q有一q来</span><span style="font-family:Helvetica;">Apache Pig</span><span style="font-family:宋体;">首次版本更新Q还一个ؓ分布式系l设计的z新工具</span><span style="font-family:Helvetica;">Runway</span><span style="font-family:宋体;">Q最后是新版</span><span style="font-family:Helvetica;">Apache Kudu</span><span style="font-family:宋体;">Q孵化中Q?/span></p> <p> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">技术新?/span></strong><strong></strong></p> <p><span style="font-family:Helvetica;">Debezium</span><span style="font-family:宋体;">是一个相对较新的目Q用于数据库?/span><span style="font-family:Helvetica;">Apache Kafka topic</span><span style="font-family:宋体;">行改变数据捕获。当面支?/span><span style="font-family:Helvetica;">MySQL</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Zookeeper</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Kafka</span><span style="font-family:宋体;">Q这是一在</span><span style="font-family:Helvetica;">Docker</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Kubernetes</span><span style="font-family:宋体;">容器上配|?/span><span style="font-family:Helvetica;">Zookeeper, Kafka, MySQL</span><span style="font-family:宋体;">的教E?/span></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">http://debezium.io/blog/2016/05/31/Debezium-on-Kubernetes/</span></a></p> <p> </p> <p><span style="font-family:宋体;">有些人对</span><span style="font-family:Helvetica;">Apache Kafka</span><span style="font-family:宋体;">目宣布采用另一U流式处理引擎感到惊Ӟq就?/span><span style="font-family:Helvetica;">Kafka Streams</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Kafka Streams</span><span style="font-family: 宋体;">与其它系l存在显著的关键差异。本文很好的C了这些不同点</span><span style="font-family:Helvetica;">——abstraction</span><span style="font-family:宋体;">、部|模型、支持基于状态的计算?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://softwaremill.com/kafka-streams-how-does-it-fit-stream-landscape/</span></p> <p> </p> <p><span style="font-family:宋体;">每个使用</span><span style="font-family:Helvetica;">MapReduce</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Spark</span><span style="font-family:宋体;">或类似系l的人都会陷入难以调试、数据特?/span><span style="font-family:Helvetica;">bug</span><span style="font-family:宋体;">q些问题中?/span><span style="font-family:Helvetica;">BigDebug</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">UCLA</span><span style="font-family:宋体;">Q加州大学洛杉矶分校Q的研究目</span><span style="font-family: Helvetica;">/</span><span style="font-family:宋体;">论文Q旨在让开发h员通过工具发现单机问题Q传入参数导致的崩溃Q跟t、断炏V观察点、gq报警等。该工具支持</span><span style="font-family:Helvetica;">Apache Spark 1.2.1</span><span style="font-family:宋体;">上?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://blog.acolyer.org/2016/06/07/bigdebug-debugging-primitives-for-interactive-big-data-processing-in-spark/</span></p> <p> </p> <p><span style="font-family:Helvetica;">Cask</span><span style="font-family:宋体;">撰文介绍了在开?/span><span style="font-family:Helvetica;">Cask Data Application Platform (CDAP)</span><span style="font-family:宋体;">中运?/span><span style="font-family:Helvetica;">Spark</span><span style="font-family:宋体;">的文章。运行在</span><span style="font-family:Helvetica;">CDAP</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Spark</span><span style="font-family:宋体;">E序通过讉K</span><span style="font-family:Helvetica;">Apache Tephra</span><span style="font-family:宋体;">Q孵化中Q实现细_度事务支持。这Pp很容易利用快照隔dC一个表复制到另一个表的一致性?/span><span style="font-family:Helvetica;">CDAP</span><span style="font-family:宋体;">中的</span><span style="font-family:Helvetica;">Spark</span><span style="font-family:宋体;">也能讉K</span><span style="font-family:Helvetica;">Cask Tracker</span><span style="font-family:宋体;">Q?/span><span style="font-family:Helvetica;">Cask Tracker</span><span style="font-family:宋体;">提供数据血~信息(什么时候创建、用等Q。根据应用的不同Q?/span><span style="font-family:Helvetica;">CDAP</span><span style="font-family:宋体;">工具q能发挥更大价倹{?/span><strong></strong></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://blog.cask.co/2016/06/cdap-spark-prototype-to-production/</span></p> <p> </p> <p><span style="font-family:Helvetica;">IBM Hadoop Dev</span><span style="font-family: 宋体;">博客撰写了从</span><span style="font-family:Helvetica;">cURL</span><span style="font-family:宋体;">调用</span><span style="font-family:Helvetica;">Ambari REST API</span><span style="font-family:宋体;">的教E。还C了在</span><span style="font-family:Helvetica;">vanilla</span><span style="font-family:宋体;">和启用了</span><span style="font-family:Helvetica;">kerberos</span><span style="font-family:宋体;">的集上建立会话Qƈ为接下来的请求复用会话?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://developer.ibm.com/hadoop/2016/06/07/ambari-rest-calls-for-kerberos-enabled-clusters/</span></p> <p> </p> <p><span style="font-family:Helvetica;">Google</span><span style="font-family:宋体;">云^台博客撰文介l了如何调试q行?/span><span style="font-family:Helvetica;">Google Dataflow</span><span style="font-family:宋体;">上的</span><span style="font-family:Helvetica;">Apache Beam</span><span style="font-family: 宋体;">Q孵化中QQ务。ؓ了调试性能瓉Q?/span><span style="font-family:Helvetica;">Dataflow</span><span style="font-family:宋体;">有一些有用的l计数据?/span><span style="font-family:Helvetica;">UI</span><span style="font-family:宋体;">来帮助用者深入每一个步骤?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://cloud.google.com/blog/big-data/2016/06/understanding-timing-in-cloud-dataflow-pipelines</span></p> <p> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">其他新闻</span></strong><strong></strong></p> <p align="left"><span style="font-family:Helvetica;">Transaction Processing Performance Council(TPC)</span><span style="font-family:宋体;">发布?/span><span style="font-family:Helvetica;">TPCx-BB</span><span style="font-family:宋体;">基准试Q该基准试为大数据pȝ设计。除了衡?/span><span style="font-family:Helvetica;">SQL</span><span style="font-family:宋体;">外,q可以对机器学习集群和分c问题进行测试?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">http://www.datanami.com/2016/06/01/big-data-benchmark-gauges-hadoop-platforms/</span></p> <p align="left"> </p> <p align="left"><span style="font-family:宋体;">伦敦</span><span style="font-family:Helvetica;">Strata + Hadoop</span><span style="font-family:宋体;">世界大会两周前已召开。演讲者的专题报告和灯片已发布到会议|站上?/span></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">http://conferences.oreilly.com/strata/hadoop-big-data-eu/public/schedule/proceedings</span></a></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">Splice Machine</span><span style="font-family:宋体;">Q?/span><span style="font-family:Helvetica;">Hadoop</span><span style="font-family:宋体;">上的</span><span style="font-family:Helvetica;">RDBMS</span><span style="font-family:宋体;">构徏者,宣布开源他们的软g。当前,他们正在L贡献?/span><span style="font-family:Helvetica;">/</span><span style="font-family:宋体;">导师</span><span style="font-family:Helvetica;">/</span><span style="font-family:宋体;">豪杰来提升开源后的效果?/span><span style="font-family:Helvetica;">Splice Machine</span><span style="font-family:宋体;">有不有的Ҏ,例如</span><span style="font-family:Helvetica;">ACID</span><span style="font-family:宋体;">事务Q二U烦引,引用完整性?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">http://www.splicemachine.com/were_going_open_source/</span></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">Altiscale</span><span style="font-family:宋体;">博客~辑了许多关于客h务、情感分析、气候变化、智慧城市?/span><span style="font-family: Helvetica;">bias</span><span style="font-family:宋体;">{方面的大数据应用案例文章。还攉了一些大数据怀疑论者的文章?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">https://www.altiscale.com/blog/big-data-news-health-and-public-safety-sentiment-analysis-fixing-education-2/</span></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">Spark</span><span style="font-family:宋体;">C本周在旧金山召开。会议组l?/span><span style="font-family:Helvetica;">Databricks</span><span style="font-family:宋体;">概述了两天内的热点内容,链接了许多的演讲和专题报告?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">https://databricks.com/blog/2016/06/08/another-record-setting-spark-summit.html</span></p> <p align="left"> </p> <p align="left"><span style="font-family:"MS Mincho";MS Mincho";">大数据即?/span><span style="font-family:SimSun;">?/span><span style="font-family:"MS Mincho";MS Mincho";">QBDaaSQ公?/span><span style="font-family:Helvetica;">Qubole</span><span style="font-family:宋体;">Q撰文介l了他们的客户如何接受?/span><span style="font-family:Helvetica;">Spark</span><span style="font-family:宋体;">。接受速度之快</span><span style="font-family:Helvetica;">——</span><span style="font-family:宋体;">一半多的客L在开始用</span><span style="font-family:Helvetica;">Spark</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Qubole</span><span style="font-family:宋体;">也支?/span><span style="font-family:Helvetica;">Presto</span><span style="font-family:宋体;">Q他们也看到了类似的增长?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">https://www.qubole.com/blog/big-data/spark-usage/</span></p> <p align="left"> </p> <p><span style="font-family:Helvetica;">Twitter</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Apache</span><span style="font-family:宋体;">孵化器提交了他们的复制日志服?/span><span style="font-family:Helvetica;">DistributedLog</span><span style="font-family:宋体;">?/span></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">https://wiki.apache.org/incubator/DistributedLogProposal</span></a></p> <p> </p> <p><span style="font-family:Helvetica;">Big Data Day LA</span><span style="font-family: 宋体;">?/span><span style="font-family:Helvetica;">6</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">9</span><span style="font-family:宋体;">日在</span><span style="font-family:宋体;color:#2E2E2E;">西洛杉矶学院召开。这ơ活动是免费的(如果预先注册的话Q,演讲者来自于</span><span style="font-family:Helvetica;">Confluent</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Databricks</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Yahoo</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Netflix</span><span style="font-family:宋体;">{?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://www.bigdatadayla.com/</span></p> <p> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">产品发布</span></strong><strong></strong></p> <p align="left"><span style="font-family:Helvetica;">Apache Spark</span><span style="font-family:宋体;">发布?/span><span style="font-family:Helvetica;">Spark 2.0</span><span style="font-family:宋体;">预览版。发布声明中说道</span><span style="font-family:Helvetica;">API</span><span style="font-family:宋体;">和功能都未最l敲定?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">https://spark.apache.org/news/spark-2.0.0-preview.html</span></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">JustOne</span><span style="font-family:宋体;">构徏q开源了</span><span style="font-family:Helvetica;">Kafka-to-PostgreSQL</span><span style="font-family:宋体;">q接器。本文介l了该连接器的性能Q详l描qC如何把消息{换ؓ行,q描qC如何讑֮配置{?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">http://www.confluent.io/blog/kafka-connect-sink-for-postgresql-from-justone-database</span></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">Salesforce</span><span style="font-family:宋体;">开源了</span><span style="font-family:Helvetica;">Runway</span><span style="font-family:宋体;">Q这是一个徏模、仿真以及可视化分布式系l。在</span><span style="font-family:Helvetica;">runway.system</span><span style="font-family:宋体;">上有一个在U演C环境,演示?/span><span style="font-family:Helvetica;">“too many bananas”</span><span style="font-family:宋体;">模型Q电梯系l和</span><span style="font-family:Helvetica;">Raft</span><span style="font-family:宋体;">一致性系l?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">https://medium.com/salesforce-open-source/runway-intro-dc0d9578e248</span></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">Bloomberg</span><span style="font-family:宋体;">最q开源了</span><span style="font-family:Helvetica;">Presto Accumulo</span><span style="font-family: 宋体;">Q面?/span><span style="font-family:Helvetica;">Apache Accumulo</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Presto</span><span style="font-family:宋体;">q接器。在声明中,链接?/span><span style="font-family:Helvetica;">11</span><span style="font-family:宋体;">늚论文Q比较了Z?/span><span style="font-family:Helvetica;">Presto</span><span style="font-family:宋体;">查询和基?/span><span style="font-family:Helvetica;">Accumulo Java API</span><span style="font-family:宋体;">查询的基准测试结果?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">http://www.bloomberg.com/company/announcements/open-source-at-bloomberg-reducing-application-development-time-via-presto-accumulo/</span></p> <p align="left"> </p> <p align="left"><span style="font-family:"MS Mincho";MS Mincho";">?/span><span style="font-family:SimSun;">?/span><span style="font-family:Helvetica;">Azure</span><span style="font-family:宋体;">发布了基?/span><span style="font-family:Helvetica;">Apache Spark 1.6.1 </span><span style="font-family:宋体;">E_版的</span><span style="font-family:Helvetica;">Azure HDInsight</span><span style="font-family:宋体;">。本ơ发布支持了面向</span><span style="font-family:Helvetica;">Spark</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Project Livy REST</span><span style="font-family: 宋体;">d服务支持Q集成了</span><span style="font-family: Helvetica;">Azure</span><span style="font-family:宋体;">数据湖存储(Z角色的访问控ӞQ集成了</span><span style="font-family:Helvetica;">IntelliJ</span><span style="font-family:宋体;">Q支持了</span><span style="font-family:Helvetica;">Jupyter</span><span style="font-family:宋体;">W记本等?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">https://azure.microsoft.com/en-us/blog/apache-spark-for-azure-hdinsight-now-generally-available/</span></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">LinkedIn</span><span style="font-family:宋体;">开源了</span><span style="font-family:Helvetica;">Photon ML</span><span style="font-family:宋体;">Q他们的大规模回归分析库?/span><span style="font-family: Helvetica;">Photon</span><span style="font-family:宋体;">构徏?/span><span style="font-family:Helvetica;">Spark</span><span style="font-family:宋体;">之上q在</span><span style="font-family:Helvetica;">LinkedIn</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">YARN</span><span style="font-family:宋体;">上运行(q去Z</span><span style="font-family:Helvetica;">MapReduce</span><span style="font-family:宋体;">Q似乎因提升性能才迁U)?/span></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">https://engineering.linkedin.com/blog/2016/06/open-sourcing-photon-ml</span></a></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">Hortonworks</span><span style="font-family:宋体;">发布?/span><span style="font-family:Helvetica;">Spark-HBase</span><span style="font-family: 宋体;">q接器的技术预览版。预览版原生支持</span><span style="font-family:Helvetica;">Avro</span><span style="font-family:宋体;">Q支持运行安全集,原生支持</span><span style="font-family:Helvetica;">Spark Datasource API</span><span style="font-family:宋体;">Qƈ优化了分Z剪,列修剪,谓词下推?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">http://hortonworks.com/blog/spark-hbase-dataframe-based-hbase-connector/</span></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">Databricks</span><span style="font-family:宋体;">发布?/span><span style="font-family:Helvetica;">Apache Spark</span><span style="font-family: 宋体;">q_的第一阶段安全Ҏ。本阶段寚w?/span><span style="font-family:Helvetica;">ACL</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">SAML 2.0</span><span style="font-family:宋体;">q行了支持,端对端的审计日志?/span></p> <p align="left"><span style="font-family:Helvetica;color:#386EFF;">https://databricks.com/blog/2016/06/08/achieving-end-to-end-security-for-apache-spark-with-databricks.html</span></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">Apache ORC 1.1.0</span><span style="font-family:宋体;">版发布了。本ơ发布完成了从基?/span><span style="font-family: Helvetica;">Apache Hive</span><span style="font-family:宋体;">的代码到Z</span><span style="font-family:Helvetica;">Java</span><span style="font-family:宋体;">的代码迁U,修正?/span><span style="font-family:Helvetica;">C++</span><span style="font-family:宋体;">旉戛_理程序,增加?/span><span style="font-family: Helvetica;">Hadoop MapReduce</span><span style="font-family:宋体;">q接器?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">http://orc.apache.org/news/2016/06/10/ORC-1.1.0/</span></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">Apache Kudu</span><span style="font-family:宋体;">发布?/span><span style="font-family:Helvetica;">0.9.0</span><span style="font-family:宋体;">版。增加了</span><span style="font-family:Helvetica;">UPSERT</span><span style="font-family:宋体;">命oQ新?/span><span style="font-family:Helvetica;">Spark</span><span style="font-family:宋体;">数据源不会依?/span><span style="font-family:Helvetica;">MapReduce API</span><span style="font-family: 宋体;">Q提升了</span><span style="font-family:Helvetica;">Tablet Server</span><span style="font-family:宋体;">写性能?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">http://getkudu.io/2016/06/10/apache-kudu-0-9-0-released.html</span></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">Google</span><span style="font-family:宋体;">云服务^台团队发布了支持</span><span style="font-family:Helvetica;">Spark 2.0</span><span style="font-family:宋体;">预览版的</span><span style="font-family:Helvetica;">Google Cloud Dataproc</span><span style="font-family: 宋体;">?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">https://cloud.google.com/blog/big-data/2016/06/google-cloud-dataproc-the-fast-easy-and-safe-way-to-try-spark-20-preview</span></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">Dory</span><span style="font-family:宋体;">Q?/span><span style="font-family:Helvetica;">Bruce</span><span style="font-family:宋体;">的承者)</span><span style="font-family:Helvetica;">Kafka producer</span><span style="font-family:宋体;">的守护进E,现在支持?/span><span style="font-family:Helvetica;">UNIX domain sockets</span><span style="font-family:宋体;">或本?/span><span style="font-family:Helvetica;">TCP</span><span style="font-family:"MS Mincho";MS Mincho";">接收数据了?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">http://mail-archives.apache.org/mod_mbox/kafka-users/201606.mbox/%3C1465683894.608424023@apps.rackspace.com%3E</span></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">Apache Pig 0.16.0</span><span style="font-family:宋体;">版,一q来首次发布。坚定了?/span><span style="font-family: Helvetica;">Tez</span><span style="font-family:宋体;">的支持?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">http://pig.apache.org/releases.html#8+June%2C+2016%3A+release+0.16.0+available</span></p> <p align="left"> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">zd</span></strong><strong></strong></p> <p align="left"><span style="font-size:14.0pt;font-family:SimSun;">中国</span></p> <p align="left"><span style="font-family:Helvetica;">Spark Meetup (</span><span style="font-family:宋体;">上v</span><span style="font-family:Helvetica;">) – </span><span style="font-family:宋体;">周六</span><span style="font-family:Helvetica;">, 6</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">18</span><span style="font-family:宋体;">?/span></p><img src ="http://m.tkk7.com/rosen/aggbug/431032.html" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://m.tkk7.com/rosen/" target="_blank">Rosen</a> 2016-06-28 17:39 <a href="http://m.tkk7.com/rosen/archive/2016/06/28/431032.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>Hadoop周刊—第 173 ?/title><link>http://m.tkk7.com/rosen/archive/2016/06/20/430972.html</link><dc:creator>Rosen</dc:creator><author>Rosen</author><pubDate>Mon, 20 Jun 2016 01:47:00 GMT</pubDate><guid>http://m.tkk7.com/rosen/archive/2016/06/20/430972.html</guid><wfw:comment>http://m.tkk7.com/rosen/comments/430972.html</wfw:comment><comments>http://m.tkk7.com/rosen/archive/2016/06/20/430972.html#Feedback</comments><slash:comments>0</slash:comments><wfw:commentRss>http://m.tkk7.com/rosen/comments/commentRss/430972.html</wfw:commentRss><trackback:ping>http://m.tkk7.com/rosen/services/trackbacks/430972.html</trackback:ping><description><![CDATA[<p align="left" style="line-height: 10%;"><strong> </strong></p> <p align="left" style="line-height: 10%;"><strong><span style="font-size:16.0pt;line-height:10%">Hadoop</span></strong><strong><span style="font-size:16.0pt;line-height:10%;font-family:宋体;">周刊</span></strong><strong> </strong><strong><span style="font-size:16.0pt;line-height: 10%;font-family:宋体;">W?/span></strong><strong><span style="font-size:16.0pt;line-height:10%"> 173 </span></strong><strong><span style="font-size:16.0pt;line-height: 10%;font-family:宋体;">?/span></strong><strong></strong></p> <p align="left" style="line-height: 10%;"> </p> <p align="left" style="line-height: 10%;"> </p> <p align="left" style="line-height: 10%;"><span style="font-size:14.0pt;line-height:10%;font-family:宋体;">启明星辰q_和大数据Ml编?/span></p> <p align="left" style="line-height: 10%;"> </p> <p align="left" style="line-height: 10%;"> </p> <p align="left" style="line-height: 10%;"><span style="font-size:14.0pt;line-height:10%">2016</span><span style="font-size:14.0pt;line-height:10%;font-family:宋体;">q?/span><span style="font-size:14.0pt;line-height:10%">6</span><span style="font-size:14.0pt;line-height:10%;font-family:宋体;">?/span><span style="font-size:14.0pt;line-height:10%">5</span><span style="font-size:14.0pt;line-height:10%;font-family:宋体;">?/span></p> <p> </p> <p><span style="font-family:宋体;">本周Q?/span><span style="font-family:Helvetica;">Spark</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">NiFi</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Netflix Meson</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Storm</span><span style="font-family:宋体;">斚w只有量内容?/span><span style="font-family:Helvetica;">Spark</span><span style="font-family:宋体;">C本周在旧金山召开Q所以呢Q下周肯定有不少内容?/span></p> <p> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">技术新?/span></strong><strong></strong></p> <p><span style="font-family:Helvetica;">Databricks</span><span style="font-family:宋体;">博客介绍?/span><span style="font-family:Helvetica;">Apache Spark 2.0</span><span style="font-family:宋体;">的新Ҏ?/span><span style="font-family:Helvetica;">——</span><span style="font-family:宋体;">跨语a支持存储和加载机器学习模型。模型通过单的</span><span style="font-family:Helvetica;">API</span><span style="font-family:宋体;">被存储和加蝲Q模型的元数据与参数保存?/span><span style="font-family:Helvetica;">JSON</span><span style="font-family:宋体;">风格Q模型的数据保存?/span><span style="font-family:Helvetica;">Parquet</span><span style="font-family:宋体;">风格?/span></p> <p><span style="font-family:Helvetica;">https://databricks.com/blog/2016/05/31/apache-spark-2-0-preview-machine-learning-model-persistence.html</span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://databricks.com/blog/2016/05/31/apache-spark-2-0-preview-machine-learning-model-persistence.html</span></p> <p> </p> <p><span style="font-family:Helvetica;">Meson</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Netflix</span><span style="font-family:宋体;">用于执行机器学习工作的框架。它?/span><span style="font-family:Helvetica;">Apache Hive</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Spark</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Mesos</span><span style="font-family:宋体;">q些大数据技术之间的_合剂。工作流使用</span><span style="font-family:Helvetica;">DSL</span><span style="font-family:宋体;">q行~写Q?/span><span style="font-family:Helvetica;">Meson</span><span style="font-family:宋体;">q提供了更加先进的流水线可视?/span><span style="font-family: Helvetica;">UI</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Netflix</span><span style="font-family:宋体;">目前没开?/span><span style="font-family:Helvetica;">Meson</span><span style="font-family:宋体;">Q但他们有这斚w的计划?/span></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">http://techblog.netflix.com/2016/05/meson_31.html</span></a></p> <p> </p> <p><span style="font-family:Helvetica;">IBM Hadoop Dev</span><span style="font-family: 宋体;">博客要介l和C?/span><span style="font-family: Helvetica;">HDFS</span><span style="font-family:宋体;">归存储能力?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://developer.ibm.com/hadoop/2016/06/01/use-hdfs-archival-storage/</span></p> <p> </p> <p><span style="font-family:Helvetica;">Apache Storm 1.0</span><span style="font-family: 宋体;">有了令h惊讶的新Ҏ。本文关注了几个调试能力斚w的增强:动态日志别、统一日志搜烦?/span><span style="font-family:Cambria;">事g抽样、集?/span><span style="font-family: Helvetica;">jstack/heap dumps/java</span><span style="font-family:宋体;">飞行记录器分?/span><span style="font-family:Helvetica;">worker</span><span style="font-family:宋体;">?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://hortonworks.com/blog/whats-new-apache-storm-1-0-part-1-enhanced-debugging/</span></p> <p> </p> <p><span style="font-family:Helvetica;">Cloudera</span><span style="font-family:宋体;">博客撰文介绍了如何?/span><span style="font-family: Helvetica;">Apache Spark</span><span style="font-family:宋体;">来探索性分析存储在</span><span style="font-family:Helvetica;">CSV</span><span style="font-family:宋体;">文g中的</span><span style="font-family:Helvetica;">NBA</span><span style="font-family:宋体;">历史l计数据。分析过E؜合用了</span><span style="font-family: Helvetica;">Scala</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">SQL</span><span style="font-family:宋体;">?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://blog.cloudera.com/blog/2016/06/how-to-analyze-fantasy-sports-using-apache-spark-and-sql/</span></p> <p> </p> <p><span style="font-family:Helvetica;">Apache NiFi</span><span style="font-family: 宋体;">作ؓ一U通用工具受到了很多的x。它?/span><span style="font-family:Helvetica;">“</span><span style="font-family:宋体;">Z程的处?/span><span style="font-family:Helvetica;">”</span><span style="font-family:宋体;">而生Q可能对很多人ƈ不意味着什么,?/span><span style="font-family:Helvetica;">NiFi</span><span style="font-family:宋体;">支持标准?/span><span style="font-family:Helvetica;">ETL</span><span style="font-family:宋体;">Q流式处理等。许?/span><span style="font-family:Helvetica;">NiFi</span><span style="font-family:宋体;">例子都示范了如何?/span><span style="font-family:Helvetica;">Twitter firehose</span><span style="font-family:宋体;">把数据移动到</span><span style="font-family:Helvetica;">HDFS</span><span style="font-family:宋体;">中,但本文聚焦在</span><span style="font-family:Helvetica;">NiFi</span><span style="font-family:宋体;">另外的特性上</span><span style="font-family:Helvetica;">——</span><span style="font-family:宋体;">C了一些简单的?/span><span style="font-family:Helvetica;">HTTP</span><span style="font-family:宋体;">拉数据的q程?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://hortonworks.com/blog/apache-nifi-not-scratch/</span></p> <p> </p> <p><span style="font-family:Helvetica;">Amazon Redshift</span><span style="font-family: 宋体;">构徏?/span><span style="font-family:Helvetica;">PostgreSQL</span><span style="font-family:宋体;">引擎上,所以你可以利用</span><span style="font-family:Helvetica;">PostgreSQL</span><span style="font-family:宋体;">的扩展功能让</span><span style="font-family:Helvetica;">Redshift</span><span style="font-family:宋体;">集群q接</span><span style="font-family:Helvetica;">PostgresSQL</span><span style="font-family:宋体;">实例。这样一来,诸如跨数据库q接、将</span><span style="font-family:Helvetica;">Redshift</span><span style="font-family:宋体;">的结果{换ؓ</span><span style="font-family:Helvetica;">JSON</span><span style="font-family:宋体;">、在</span><span style="font-family:Helvetica;">Postgres</span><span style="font-family:宋体;">中创?/span><span style="font-family:Helvetica;">Redshift</span><span style="font-family:宋体;">数据视图?/span></p> <p><span style="font-family:宋体;">数据库之间复制数据等有趣的应用都能实现?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://blogs.aws.amazon.com/bigdata/post/Tx1GQ6WLEWVJ1OX/JOIN-Amazon-Redshift-AND-Amazon-RDS-PostgreSQL-WITH-dblink</span></p> <p> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">其他发布</span></strong><strong></strong></p> <p align="left"><span style="font-family:Helvetica;">FeatherCast</span><span style="font-family:宋体;">发布了超q?/span><span style="font-family:Helvetica;">100</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">ApacheCon</span><span style="font-family:宋体;">北美C的相兛_韟?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">http://feathercast.apache.org/tag/apacheconna2016/</span></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">InfoWorld</span><span style="font-family:宋体;">介绍?/span><span style="font-family:Helvetica;">Heron</span><span style="font-family:宋体;">Q?/span><span style="font-family:Helvetica;">Twitter</span><span style="font-family:宋体;">才开源的</span><span style="font-family:Helvetica;">Apache Storm</span><span style="font-family:宋体;">兼容目。本文介l了两个目在架构上的不同。主要指Z</span><span style="font-family:Helvetica;">Heron</span><span style="font-family:宋体;">h于几个月前(</span><span style="font-family:Helvetica;">Storm</span><span style="font-family:宋体;">已发布)Q就是说</span><span style="font-family:Helvetica;">Storm</span><span style="font-family:宋体;">在特性上?/span><span style="font-family:Helvetica;">Heron</span><span style="font-family:宋体;">更有优势?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">http://www.infoworld.com/article/3078134/analytics/had-it-with-apache-storm-heron-swoops-to-the-rescue.html</span></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">Databricks</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">edX</span><span style="font-family:宋体;">上开了一门新评Q?/span><span style="font-family:Helvetica;">“Apache Spark</span><span style="font-family:宋体;">入门</span><span style="font-family:Helvetica;">”</span><span style="font-family:宋体;">。课E从</span><span style="font-family:Helvetica;">6</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">15</span><span style="font-family:宋体;">日开始,一直持l两周?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">launch-first-of-five-free-big-data-courses-on-apache-spark.html</span></p> <p> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">产品发布</span></strong><strong></strong></p> <p align="left"><span style="font-family:Helvetica;">Amazon EMR</span><span style="font-family:宋体;">发布?/span><span style="font-family:Helvetica;">4.7.0</span><span style="font-family:宋体;">版。本ơ发布支持了</span><span style="font-family:Helvetica;">Apache Tez</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Apache Phoenix</span><span style="font-family:宋体;">Qƈ内置了新版本?/span><span style="font-family:Helvetica;">Apache HBase</span><span style="font-family: 宋体;">?/span><span style="font-family:Helvetica;">Apache Mahout</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Presto</span><span style="font-family:宋体;">。另外,</span><span style="font-family:Helvetica;">AWS</span><span style="font-family:宋体;">大数据博客还指导?/span><span style="font-family:Helvetica;">Phoenix</span><span style="font-family:宋体;">如何上手?/span></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">http://aws.amazon.com/blogs/aws/amazon-emr-4-7-0-apache-tez-phoenix-updates-to-existing-apps/</span></a></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">http://blogs.aws.amazon.com/bigdata/post/Tx2ZF1NDQYDJFGT/Supercharge-SQL-on-Your-Data-in-Apache-HBase-with-Apache-Phoenix</span></a></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">Apache Hive</span><span style="font-family:宋体;">本周发布?/span><span style="font-family:Helvetica;">2.0.1</span><span style="font-family:宋体;">版。从二月发布</span><span style="font-family:Helvetica;">2.0.0</span><span style="font-family:宋体;">以来Q首ơ小版本发布。本ơ修复了</span><span style="font-family:Helvetica;">60</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">bug</span><span style="font-family:宋体;">?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">http://mail-archives.us.apache.org/mod_mbox/www-announce/201605.mbox/%3CD37344A3.77A64%25sershe@apache.org%3E</span></p> <p align="left"> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">zd</span></strong><strong></strong></p> <p align="left"><span style="font-size:14.0pt;font-family:SimSun;">中国</span></p> <p align="left"><span style="font-family:宋体;">?/span></p><img src ="http://m.tkk7.com/rosen/aggbug/430972.html" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://m.tkk7.com/rosen/" target="_blank">Rosen</a> 2016-06-20 09:47 <a href="http://m.tkk7.com/rosen/archive/2016/06/20/430972.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>Hadoop周刊—第 172 ?/title><link>http://m.tkk7.com/rosen/archive/2016/06/09/430841.html</link><dc:creator>Rosen</dc:creator><author>Rosen</author><pubDate>Wed, 08 Jun 2016 16:11:00 GMT</pubDate><guid>http://m.tkk7.com/rosen/archive/2016/06/09/430841.html</guid><wfw:comment>http://m.tkk7.com/rosen/comments/430841.html</wfw:comment><comments>http://m.tkk7.com/rosen/archive/2016/06/09/430841.html#Feedback</comments><slash:comments>0</slash:comments><wfw:commentRss>http://m.tkk7.com/rosen/comments/commentRss/430841.html</wfw:commentRss><trackback:ping>http://m.tkk7.com/rosen/services/trackbacks/430841.html</trackback:ping><description><![CDATA[<p align="left" style="line-height: 10%;"><strong> </strong></p> <p align="left" style="line-height: 10%;"><strong><span style="font-size:16.0pt;line-height:10%">Hadoop</span></strong><strong><span style="font-size:16.0pt;line-height:10%;font-family:宋体;">周刊</span></strong><strong> </strong><strong><span style="font-size:16.0pt;line-height: 10%;font-family:宋体;">W?/span></strong><strong><span style="font-size:16.0pt;line-height:10%"> 172 </span></strong><strong><span style="font-size:16.0pt;line-height: 10%;font-family:宋体;">?/span></strong><strong></strong></p> <p align="left" style="line-height: 10%;"> </p> <p align="left" style="line-height: 10%;"> </p> <p align="left" style="line-height: 10%;"><span style="font-size:14.0pt;line-height:10%;font-family:宋体;">启明星辰q_和大数据Ml编?/span></p> <p align="left" style="line-height: 10%;"> </p> <p align="left" style="line-height: 10%;"> </p> <p align="left" style="line-height: 10%;"><span style="font-size:14.0pt;line-height:10%">2016</span><span style="font-size:14.0pt;line-height:10%;font-family:宋体;">q?/span><span style="font-size:14.0pt;line-height:10%">5</span><span style="font-size:14.0pt;line-height:10%;font-family:宋体;">?/span><span style="font-size:14.0pt;line-height:10%">22</span><span style="font-size:14.0pt;line-height:10%;font-family:宋体;">?/span></p> <p> </p> <p><span style="font-family:宋体;">本周主要x式计算</span><span style="font-family:Helvetica;">—— Twitter</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Cloudera</span><span style="font-family:宋体;">介绍了他们新的流式计框Ӟ有文章介l了</span><span style="font-family:Helvetica;">Apache Flink</span><span style="font-family:宋体;">的流?/span><span style="font-family:Helvetica;">SQL</span><span style="font-family:宋体;">Q?/span><span style="font-family:Helvetica;">DataTorrent</span><span style="font-family: 宋体;">介绍?/span><span style="font-family:Helvetica;">Apache Apex</span><span style="font-family:宋体;">定w机制Q还?/span><span style="font-family:Helvetica;">Concord</span><span style="font-family:宋体;">q样新的式计算框架Q另外还?/span><span style="font-family:Helvetica;">Apache Kafka</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">0.10</span><span style="font-family:宋体;">版。其他新L面,</span><span style="font-family:Helvetica;">Apache</span><span style="font-family:宋体;">孵化器有新动?/span><span style="font-family:Helvetica;">——Apache TinkerPop</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Apache Zeppelin</span><span style="font-family:宋体;">孵化成ؓ目Q?/span><span style="font-family:Helvetica;">Tephra</span><span style="font-family:宋体;">q入孵化器。除了上q内容,</span><span style="font-family: Helvetica;">Apache Spark</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Apache HBase</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Apache Drill</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Apache Ambari</span><span style="font-family:宋体;">{也有新文章?/span></p> <p> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">技术新?/span></strong><strong></strong></p> <p align="left"><span style="font-family:Helvetica;">DataTorrent</span><span style="font-family:宋体;">博客撰文介绍?/span><span style="font-family:Helvetica;">Apache Apex</span><span style="font-family:宋体;">在读写数据文件时的容错机制?/span><span style="font-family:Helvetica;">Apex</span><span style="font-family:宋体;">是专门处理流式数据的Q流式计有一些微妙但重要的细节需要考虑。例如?/span><span style="font-family:Helvetica;">HDFS</span><span style="font-family:宋体;">输出Ӟ</span><span style="font-family:Helvetica;">HDFS</span><span style="font-family:宋体;">的租U机制会引发问题?/span></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">https://www.datatorrent.com/blog/fault-tolerant-file-processing/</span></a></p> <p> </p> <p><span style="font-family:Helvetica;">Databricks</span><span style="font-family:宋体;">博客介绍?/span><span style="font-family:Helvetica;">Spark 2.0</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Tungsten</span><span style="font-family:宋体;">代码生成引擎带来的性能提升。博文D例说明了׃虚拟函数的管理,更好地利?/span><span style="font-family:Helvetica;">CPU</span><span style="font-family:宋体;">寄存器和循环展开Q所以代码生成引擎能更快的生成代码。除?/span><span style="font-family: Helvetica;">Databricks</span><span style="font-family:宋体;">的博文外Q?/span><span style="font-family:Helvetica;">Morning Paper</span><span style="font-family:宋体;">q谈C上技术其实是受到</span><span style="font-family: Helvetica;">VLDB</span><span style="font-family:宋体;">论文的启发?/span></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">https://databricks.com/blog/2016/05/23/apache-spark-as-a-compiler-joining-a-billion-rows-per-second-on-a-laptop.html</span></a></p> <p><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">https://blog.acolyer.org/2016/05/23/efficiently-compiling-efficient-query-plans-for-modern-hardware/</span></a></p> <p> </p> <p><span style="font-family:Helvetica;">StreamScope</span><span style="font-family: 宋体;">是微软流式处理系l,?/span><span style="font-family: Helvetica;">Morning Paper</span><span style="font-family:宋体;">本周撰写的另一个流式计文章。介l了该系l的特征</span><span style="font-family:Helvetica;">——</span><span style="font-family:宋体;">吞吐?/span><span style="font-family:Helvetica;">/</span><span style="font-family:宋体;">集群大小、编E模?/span><span style="font-family:Helvetica;">(SQL)</span><span style="font-family:宋体;">、时间模型、语义学</span><span style="font-family:Helvetica;">/</span><span style="font-family:宋体;">保证Q以及微软品中的应用?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://blog.acolyer.org/2016/05/24/streamscope-continuous-reliable-distributed-processing-of-big-data-streams/</span></p> <p> </p> <p><span style="font-family:Helvetica;">Apache</span><span style="font-family:宋体;">博客撰文介绍?/span><span style="font-family:Helvetica;">HubSpot</span><span style="font-family:宋体;">团队?/span><span style="font-family:Helvetica;">Apache HBase</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">G1GC</span><span style="font-family:宋体;">调优斚w的经验。本文回?/span><span style="font-family:Helvetica;">HubSpot</span><span style="font-family:宋体;">如何试和保障稳定性、如何保?/span><span style="font-family:Helvetica;">99%</span><span style="font-family:宋体;">的性能、如何羃短花在垃圑֛收上的时间。该团队使用很多技巧,很好地决l了错综复杂?/span><span style="font-family:Helvetica;">GC</span><span style="font-family:宋体;">法。本文最后,q一步步C?/span><span style="font-family:Helvetica;">HBase</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">G1GC</span><span style="font-family:宋体;">调优?/span></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">https://blogs.apache.org/hbase/entry/tuning_g1gc_for_your_hbase</span></a></p> <p> </p> <p><span style="font-family:Helvetica;">LinkedIn</span><span style="font-family:宋体;">撰文阐述了调?/span><span style="font-family:Helvetica;">Kafka</span><span style="font-family:宋体;">偏移量管理问题的诸多困难。本文聚焦了两个所?/span><span style="font-family:Helvetica;">"offset rewind"</span><span style="font-family: Cambria;">事g的症Ӟ如何在监控过E中到q类事gQ以及导致这两个事g的根本原因(及解x案)?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://engineering.linkedin.com/blog/2016/05/kafkaesque-days-at-linkedin--part-1</span></p> <p> </p> <p><span style="font-family:Helvetica;">Databricks</span><span style="font-family:宋体;">博客发布了?/span><span style="font-family:Helvetica;">Apache Spark</span><span style="font-family:宋体;">q行基因变异分析pd文章的第三部分也是最后一。本文从准备Q把文g转换?/span><span style="font-family:Helvetica;">Parquet</span><span style="font-family:宋体;">q加载进</span><span style="font-family:Helvetica;">Spark RRD</span><span style="font-family:宋体;">Q到如何加蝲基因型数据再到运?/span><span style="font-family: Helvetica;">kmeans</span><span style="font-family:宋体;">聚类法Z基因型特征预地理种?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://databricks.com/blog/2016/05/24/predicting-geographic-population-using-genome-variants-and-k-means.html</span></p> <p> </p> <p><span style="font-family:宋体;">许多批处理大数据生态系l已从自定义</span><span style="font-family:Helvetica;">API</span><span style="font-family:宋体;">回到</span><span style="font-family:Helvetica;">SQL</span><span style="font-family:宋体;">上,所以如果流式处理框架也发生了同L变化Q一定很有趣。本文,</span><span style="font-family:Helvetica;">Apache Flink</span><span style="font-family:宋体;">团队介绍他们计划支持式</span><span style="font-family:Helvetica;">SQL</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Flink</span><span style="font-family:宋体;">已经有了</span><span style="font-family:Helvetica;">Table API</span><span style="font-family:宋体;">Q他们利?/span><span style="font-family:Helvetica;">Apache Calcite</span><span style="font-family:宋体;">提供了对</span><span style="font-family:Helvetica;">SQL</span><span style="font-family:宋体;">的支持。对?/span><span style="font-family:Helvetica;">windowing</span><span style="font-family:宋体;">Q他们计划用</span><span style="font-family:Helvetica;">Calcite</span><span style="font-family:宋体;">的流?/span><span style="font-family:Helvetica;">SQL</span><span style="font-family:宋体;">扩展。最初对</span><span style="font-family:Helvetica;">SQL</span><span style="font-family:宋体;">的支持将?/span><span style="font-family:Helvetica;">1.1.0</span><span style="font-family:宋体;">版中体现Q在</span><span style="font-family:Helvetica;">1.2.0</span><span style="font-family:宋体;">版加强?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://flink.apache.org/news/2016/05/24/stream-sql.html</span></p> <p> </p> <p><span style="font-family:宋体;">本文介绍?/span><span style="font-family:Helvetica;">Apache Drill</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">XML</span><span style="font-family:宋体;">插g。尽还没有?/span><span style="font-family:Helvetica;">Drill</span><span style="font-family:宋体;">集成在一P但它相当Ҏ被编译成</span><span style="font-family:Helvetica;">jar</span><span style="font-family:宋体;">和配|对</span><span style="font-family:Helvetica;">XML</span><span style="font-family:宋体;">的支持?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://www.mapr.com/blog/how-use-xml-plugin-apache-drill</span></p> <p> </p> <p><span style="font-family:Helvetica;">Hortonworks</span><span style="font-family: 宋体;">博客略介l了</span><span style="font-family:Helvetica;">Ambari</span><span style="font-family:宋体;">监控度量pȝ的架构,最q加入了</span><span style="font-family:Helvetica;">Grafana</span><span style="font-family:宋体;">作ؓ其前端A表盘。该pȝ使用</span><span style="font-family:Helvetica;">Apache Phoenix</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Apache HBase</span><span style="font-family:宋体;">作ؓ存储支撑Q所以是可以横向扩展的?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://hortonworks.com/blog/hood-ambari-metrics-grafana/</span></p> <p> </p> <p><span style="font-family:宋体;">q篇教程介绍了怎样?/span><span style="font-family:Helvetica;">Amazon EMR</span><span style="font-family:宋体;">上?/span><span style="font-family:Helvetica;">Spark SQL</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Hue</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Apache Zeppelin</span><span style="font-family:宋体;">配合q行</span><span style="font-family:Helvetica;">SQL</span><span style="font-family:宋体;">查询存储?/span><span style="font-family:Helvetica;">S3</span><span style="font-family:宋体;">中跨制表W分割的数据。本文最后展CZ如何?/span><span style="font-family:Helvetica;">Spark</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">DynamoDB</span><span style="font-family:宋体;">存储数据?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://blogs.aws.amazon.com/bigdata/post/Tx2D93GZRHU3TES/Using-Spark-SQL-for-ETL</span></p> <p> </p> <p><span style="font-family:Helvetica;">Heroku</span><span style="font-family:宋体;">团队分n了他们用最新版</span><span style="font-family: Helvetica;">Apache Kafka</span><span style="font-family:宋体;">的体?/span><span style="font-family:Helvetica;">——</span><span style="font-family:宋体;">才引入的</span><span style="font-family:Helvetica;">timestamp</span><span style="font-family:宋体;">字段Q?/span><span style="font-family:Helvetica;">8</span><span style="font-family:宋体;">字节Q会D一些反直觉的性能变化?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://engineering.heroku.com/blogs/2016-05-27-apache-kafka-010-evaluating-performance-in-distributed-systems/</span></p> <p> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">其他新闻</span></strong><strong></strong></p> <p><span style="font-family:Helvetica;">O'Reilly</span><span style="font-family:宋体;">数据播客U?/span><span style="font-family:Helvetica;">Spark 2.0</span><span style="font-family:宋体;">中结构化式计算斚w的问题采访了来自</span><span style="font-family:Helvetica;">Databricks</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Michael Armbrust</span><span style="font-family: 宋体;">。网站上的一文章选择引用了其中的话题</span><span style="font-family:Helvetica;">—— Spark SQL</span><span style="font-family:宋体;">、结构化式计算的目标、端到端道的保证、对在线处理q用</span><span style="font-family:Helvetica;">Spark</span><span style="font-family:宋体;">机器学习法?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://www.oreilly.com/ideas/structured-streaming-comes-to-apache-spark-2-0</span></p> <p> </p> <p><span style="font-family:宋体;">本周两个大数据项目从</span><span style="font-family:Helvetica;">Apache</span><span style="font-family:宋体;">孵化器孵化完?/span><span style="font-family:Helvetica;">——Apache TinkerPop</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Apache Zeppelin</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">TinkerPop</span><span style="font-family:宋体;">是图计算框架Q?/span><span style="font-family:Helvetica;">Zeppelin</span><span style="font-family:宋体;">是面向数据分析基?/span><span style="font-family:Helvetica;">web</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">notebook</span><span style="font-family:宋体;">?/span></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">https://blogs.apache.org/foundation/entry/the_apache_software_foundation_announces91</span></a></p> <p><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">https://blogs.apache.org/foundation/entry/the_apache_software_foundation_announces92</span></a></p> <p> </p> <p><span style="font-family:Helvetica;">Tephra</span><span style="font-family:宋体;">Q?/span><span style="font-family:Helvetica;">HBase</span><span style="font-family:宋体;">的事务引擎进入了</span><span style="font-family:Helvetica;">Apache</span><span style="font-family:宋体;">孵化器?/span><span style="font-family:Helvetica;">Tephra</span><span style="font-family:宋体;">最初由</span><span style="font-family:Helvetica;">Cask</span><span style="font-family:宋体;">的团队创建,目前仅和</span><span style="font-family:Helvetica;">Apache Phoenix</span><span style="font-family:宋体;">q行了集成?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://blog.cask.co/2016/05/tephra-a-transaction-engine-for-hbase-moves-to-apache-incubation/</span></p> <p> </p> <p><span style="font-family:Helvetica;">TechRepublic</span><span style="font-family: 宋体;">撰文介绍?/span><span style="font-family:Helvetica;">Concord.io</span><span style="font-family:宋体;">Q一个由</span><span style="font-family:Helvetica;">C++</span><span style="font-family:宋体;">开发的式处理框架。旨在填补高性能式计算市场的空~?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://www.techrepublic.com/article/could-concord-topple-apache-spark-from-its-big-data-throne/</span></p> <p> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">产品发布</span></strong><strong></strong></p> <p align="left"><span style="font-family:Helvetica;">Apache Avro</span><span style="font-family:宋体;">本周发布?/span><span style="font-family:Helvetica;">1.8.1</span><span style="font-family:宋体;">版。修复了过</span><span style="font-family:Helvetica;">20</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">bug</span><span style="font-family:宋体;">和一些其它进步?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">http://mail-archives.us.apache.org/mod_mbox/www-announce/201605.mbox/%3CCAO4re1nYMm79WQ2LUeODWjHmJ9EiYOF=mty6p2aiq-S_4R95iQ@mail.gmail.com%3E</span></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">Confluent</span><span style="font-family:宋体;">发布了基?/span><span style="font-family:Helvetica;">librdkafka</span><span style="font-family:宋体;">开发的</span><span style="font-family:Helvetica;">Kafka Python</span><span style="font-family:宋体;">客户端?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">https://pypi.python.org/pypi/confluent-kafka/0.9.1.1</span></p> <p align="left"> </p> <p align="left"><span style="font-family:"MS Mincho";MS Mincho";">伴随着新的</span><span style="font-family:Helvetica;">Kafka </span><span style="font-family:宋体;">式计算方式Q?/span><span style="font-family:Helvetica;">Apache Kafka 0.10</span><span style="font-family:宋体;">版发布了。新版本支持了机架感知和消息中的</span><span style="font-family:Helvetica;">timestamp</span><span style="font-family:宋体;">Q提升了</span><span style="font-family:Helvetica;">SASL</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Kafka Connect</span><span style="font-family:宋体;">{?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">http://mail-archives.us.apache.org/mod_mbox/www-announce/201605.mbox/%3CCAPuboUuRyCRxDp5CLjv2yVM77SpYFF+HdnBeiiyeumYTJNpY4g@mail.gmail.com%3E</span></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">Confluent</span><span style="font-family:宋体;">发布了基?/span><span style="font-family:Helvetica;">Apache Kafka 0.10</span><span style="font-family: 宋体;">?/span><span style="font-family:Helvetica;">Confluent Platform 3.0</span><span style="font-family:宋体;">版。除?/span><span style="font-family:Helvetica;">Kafka</span><span style="font-family:宋体;">的核心特性,</span><span style="font-family:Helvetica;">Confluent Platform</span><span style="font-family:宋体;">q有一个商业组件ؓ</span><span style="font-family:Helvetica;">Kafka Connect</span><span style="font-family:宋体;">提供配置工具和端到端监控?/span></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">http://www.confluent.io/blog/announcing-apache-kafka-0.10-and-confluent-platform-3.0</span></a></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">Apache Kylin</span><span style="font-family:宋体;">Q大数据</span><span style="font-family:Helvetica;">OLAP</span><span style="font-family:宋体;">引擎Q发布了</span><span style="font-family:Helvetica;">1.5.2</span><span style="font-family:宋体;">版。作Zơ补丁的发布,</span><span style="font-family:Helvetica;">1.5.2</span><span style="font-family:宋体;">有不新Ҏ?/span><span style="font-family:Helvetica;">/</span><span style="font-family:宋体;">提升</span><span style="font-family:Helvetica;">/bug</span><span style="font-family:宋体;">修复Q包括支?/span><span style="font-family:Helvetica;">CDH 5.7</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">MapR</span><span style="font-family:宋体;">?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">http://mail-archives.us.apache.org/mod_mbox/www-announce/201605.mbox/%3CCA+LQBaTDxb4wVYVvtOC22gMbJ0p9cvhAWzEY_x2n1oNGvEDPSQ@mail.gmail.com%3E</span></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">Twitter</span><span style="font-family:宋体;">开源了他们的流式处理系l?/span><span style="font-family:Helvetica;">Heron</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Heron</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Twitter</span><span style="font-family:宋体;">用于替换</span><span style="font-family:Helvetica;">Apache Storm</span><span style="font-family: 宋体;">的品,发力点在性能、调试以及开发h员生产率?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">https://blog.twitter.com/2016/open-sourcing-twitter-heron</span></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">Envelope</span><span style="font-family:宋体;">是来自于</span><span style="font-family:Helvetica;">Cloudera Labs</span><span style="font-family: 宋体;">的新目Q它提供了基于配|文件的式</span><span style="font-family:Helvetica;">ETL</span><span style="font-family:宋体;">处理q程。构建在</span><span style="font-family:Helvetica;">Spark streaming</span><span style="font-family:宋体;">之上Q?/span><span style="font-family:Helvetica;">Envelope</span><span style="font-family:宋体;">最q正在研发面?/span><span style="font-family:Helvetica;">Kafka</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Kudu</span><span style="font-family:宋体;">的连接器?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">http://blog.cloudera.com/blog/2016/05/new-in-cloudera-labs-envelope-for-apache-spark-streaming/</span></p> <p align="left"> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">zd</span></strong><strong></strong></p> <p align="left"><span style="font-size:14.0pt;font-family:SimSun;">中国</span></p> <p align="left"><span style="font-family:Helvetica;">Spark Meetup 4 (</span><span style="font-family:宋体;">杭州</span><span style="font-family:Helvetica;">) – </span><span style="font-family:宋体;">周日</span><span style="font-family:Helvetica;">, 6</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">5</span><span style="font-family:宋体;">?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">http://www.meetup.com/Hangzhou-Apache-Spark-Meetup/events/231071384/</span></p><img src ="http://m.tkk7.com/rosen/aggbug/430841.html" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://m.tkk7.com/rosen/" target="_blank">Rosen</a> 2016-06-09 00:11 <a href="http://m.tkk7.com/rosen/archive/2016/06/09/430841.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>Hadoop周刊—第 171 ?/title><link>http://m.tkk7.com/rosen/archive/2016/06/08/430838.html</link><dc:creator>Rosen</dc:creator><author>Rosen</author><pubDate>Wed, 08 Jun 2016 08:42:00 GMT</pubDate><guid>http://m.tkk7.com/rosen/archive/2016/06/08/430838.html</guid><wfw:comment>http://m.tkk7.com/rosen/comments/430838.html</wfw:comment><comments>http://m.tkk7.com/rosen/archive/2016/06/08/430838.html#Feedback</comments><slash:comments>0</slash:comments><wfw:commentRss>http://m.tkk7.com/rosen/comments/commentRss/430838.html</wfw:commentRss><trackback:ping>http://m.tkk7.com/rosen/services/trackbacks/430838.html</trackback:ping><description><![CDATA[<p align="left" style="line-height: 10%;"><strong> </strong></p> <p align="left" style="line-height: 10%;"><strong><span style="font-size:16.0pt;line-height:10%">Hadoop</span></strong><strong><span style="font-size:16.0pt;line-height:10%;font-family:宋体;">周刊</span></strong><strong> </strong><strong><span style="font-size:16.0pt;line-height: 10%;font-family:宋体;">W?/span></strong><strong><span style="font-size:16.0pt;line-height:10%"> 171 </span></strong><strong><span style="font-size:16.0pt;line-height: 10%;font-family:宋体;">?/span></strong><strong></strong></p> <p align="left" style="line-height: 10%;"> </p> <p align="left" style="line-height: 10%;"> </p> <p align="left" style="line-height: 10%;"><span style="font-size:14.0pt;line-height:10%;font-family:宋体;">启明星辰q_和大数据Ml编?/span></p> <p align="left" style="line-height: 10%;"> </p> <p align="left" style="line-height: 10%;"> </p> <p align="left" style="line-height: 10%;"><span style="font-size:14.0pt;line-height:10%">2016</span><span style="font-size:14.0pt;line-height:10%;font-family:宋体;">q?/span><span style="font-size:14.0pt;line-height:10%">5</span><span style="font-size:14.0pt;line-height:10%;font-family:宋体;">?/span><span style="font-size:14.0pt;line-height:10%">22</span><span style="font-size:14.0pt;line-height:10%;font-family:宋体;">?/span></p> <p> </p> <p><span style="font-family:宋体;">本周Q包?/span><span style="font-family:Helvetica;">LinkedIn</span><span style="font-family:宋体;">新开源项目在内的几个目都有版本发布。在技术新d其他新闻斚wQ多文章回了</span><span style="font-family:Helvetica;">Apache: Big Data North America</span><span style="font-family:宋体;">会议Q另外有一l跨多个不同数据系l分析纽U出UR数据的系列文章?/span></p> <p> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">技术新?/span></strong><strong></strong></p> <p><span style="font-family:Helvetica;">Databricks</span><span style="font-family:宋体;">博客分析?/span><span style="font-family:Helvetica;">Apache Spark</span><span style="font-family:宋体;">中两UD法。之一Q?/span><span style="font-family:Helvetica;">“approxCountDistict”</span><span style="font-family:宋体;">是用来评C同值的数量Q之二,</span><span style="font-family: Helvetica;">“approxQuantile”</span><span style="font-family:宋体;">用于生成D癑ֈ比。本文介l了法和可视化_ֺ不同的残差?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://databricks.com/blog/2016/05/19/approximate-algorithms-in-apache-spark-hyperloglog-and-quantiles.html</span></p> <p> </p> <p><span style="font-family:宋体;">本教E描qC如何使用</span><span style="font-family:Helvetica;">Apache Hadoop HDFS</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Apache Solr</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Hue</span><span style="font-family:宋体;">存储、烦引、查?/span><span style="font-family:Helvetica;">DICOM</span><span style="font-family:宋体;">格式的医学媄像。文章诏I了加蝲和获取数据的整个步骤?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://blog.cloudera.com/blog/2016/05/how-to-process-and-index-medical-images-with-apache-hadoop-and-apache-solr/</span></p> <p> </p> <p><span style="font-family:Helvetica;">MapR Streams</span><span style="font-family: 宋体;">是一?/span><span style="font-family:Helvetica;">API</span><span style="font-family:宋体;">兼容</span><span style="font-family:Helvetica;">Apache Kafka</span><span style="font-family:宋体;">的系l。本文在宏观上比较了</span><span style="font-family:Helvetica;">MapR Streams</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Kafka</span><span style="font-family:宋体;">的异同。同旉明了</span><span style="font-family:Helvetica;">Kafka Streams</span><span style="font-family:宋体;">怎样?/span><span style="font-family:Helvetica;">MapR Streams</span><span style="font-family:宋体;">扯上关系的?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://www.mapr.com/blog/apache-kafka-and-mapr-streams-terms-techniques-and-new-designs</span></p> <p> </p> <p><span style="font-family:宋体;">本文在我看来是最清晰介绍</span><span style="font-family:Helvetica;">Paxos</span><span style="font-family:宋体;">的文章之一Q?/span><span style="font-family:Helvetica;">Paxos</span><span style="font-family:宋体;">为分布式pȝ构徏了一致性协议。本文用l图计算机和分布式拍卖示范了q个协议?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://ifeanyi.co/posts/understanding-consensus/</span></p> <p> </p> <p><span style="font-family:宋体;">Z</span><span style="font-family:Helvetica;">Apache: Big Data North America</span><span style="font-family:宋体;">会议上的一演讌Ӏ?/span><span style="font-family:Helvetica;">Datanami</span><span style="font-family:宋体;">H探了即发布的</span><span style="font-family:Helvetica;">Apache Hadoop 3</span><span style="font-family: 宋体;">的新Ҏ。包括,</span><span style="font-family:Helvetica;">shell</span><span style="font-family:宋体;">脚本重写、Q务集本地优化、内存大自动׾~能力、支?/span><span style="font-family:Helvetica;">HDFS erasure codings</span><span style="font-family:宋体;">。本文着重在</span><span style="font-family:Helvetica;">erasure codings</span><span style="font-family:宋体;">上,文章密切x?/span><span style="font-family:Helvetica;">erasure codings</span><span style="font-family:宋体;">在存储效率方面的提升Q?/span><span style="font-family: Helvetica;">3x</span><span style="font-family:宋体;">盘消耗降低到</span><span style="font-family:Helvetica;">1.5x</span><span style="font-family:宋体;">Q?/span></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">http://www.datanami.com/2016/05/18/hadoop-3-poised-boost-storage-capacity-resilience-erasure-coding/</span></a></p> <p> </p> <p><span style="font-family:宋体;">q篇演讲来自?/span><span style="font-family:Helvetica;">PyData</span><span style="font-family:宋体;">柏林会议Q描qC</span><span style="font-family:Helvetica;">Apache Arrow</span><span style="font-family: 宋体;">?/span><span style="font-family:Helvetica;">Feather</span><span style="font-family:宋体;">文g格式Q探I了数据在跨语言</span><span style="font-family:Helvetica;">/</span><span style="font-family:宋体;">框架互操作性的工作机制?/span></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">http://www.slideshare.net/wesm/python-data-ecosystem-thoughts-on-building-for-the-future</span></a></p> <p> </p> <p><span style="font-family:宋体;">发布了两个来自于不同会议?/span><span style="font-family:Helvetica;">Apache Kafka</span><span style="font-family:宋体;">有关的演讲视频。第一个讨Z</span><span style="font-family: Helvetica;">Kafka</span><span style="font-family:宋体;">的安全特性,W二个探索了</span><span style="font-family:Helvetica;">Kafka</span><span style="font-family:宋体;">如何跨系l共享数据?/span></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">https://www.oreilly.com/learning/securing-apache-kafka</span></a></p> <p><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">https://www.infoq.com/presentations/event-streams-kafka</span></a></p> <p> </p> <p><span style="font-family:宋体;">q篇博客集成了数利?/span><span style="font-family:Helvetica;">Amazon Redshift</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Google BigQuery</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Postgres</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Presto</span><span style="font-family:宋体;">数据pȝ加蝲</span><span style="font-family:Helvetica;">/</span><span style="font-family:宋体;">查询U约出租车数据的文章。除了原始基准测试,q详l介l了如何处理故障、优化、比较替代方案(</span><span style="font-family:Helvetica;">AWS</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">S3</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">HDFS</span><span style="font-family:宋体;">比)?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://tech.marksblogg.com/all-billion-nyc-taxi-rides-redshift.html</span></p> <p> </p> <p><span style="font-family:Helvetica;">O'Reilly</span><span style="font-family:宋体;">撰文介绍了通过</span><span style="font-family:Helvetica;">Kafka</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Flink</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Elasticsearch</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Kibana</span><span style="font-family:宋体;">怎样实现</span><span style="font-family:Helvetica;">kappa</span><span style="font-family:宋体;">架构。文章概qC</span><span style="font-family:Helvetica;">lambda</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">kappa</span><span style="font-family:宋体;">架构Q介l了主要的架构组Ӟ以及怎样讄使用贝叶斯模型发现新奇事物?/span></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">http://www.oreilly.com/ideas/applying-the-kappa-architecture-in-the-telco-industry</span></a></p> <p> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">其他新闻</span></strong><strong></strong></p> <p><span style="font-family:宋体;">本文列D了最q在</span><span style="font-family:Helvetica;">Apache: Big Data North America</span><span style="font-family:宋体;">会议上提到的几个大数据生态系l项目。有不少是我们没U_视线的内宏V?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://www.datanami.com/2016/05/11/open-source-tour-de-force-apache-big-data-2016/</span></p> <p> </p> <p><span style="font-family:Helvetica;">Pivotal</span><span style="font-family:宋体;">博客有一关于大数据和敏捷开发有的文章。大数据pȝ往往停留在非敏捷的世界,例如在装载数据前需求要攉CQ模型要定义好。本文认为,没有在云环境中经q长期验证的话,要对q种方式q行U束Q有限的能力和性能、竖井式数据{)?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://blog.pivotal.io/big-data-pivotal/features/when-it-comes-to-big-data-cloud-and-agility-go-hand-in-hand</span></p> <p> </p> <p><span style="font-family:Helvetica;">Databricks</span><span style="font-family:宋体;">发布了他们记录的|络会议视频</span><span style="font-family: Helvetica;">“Apache Spark MLlib: From Quick Start to Scikit-Learn”</span><span style="font-family:宋体;">。除了视频内容,他们q在会议中解{了八个常见问题?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://databricks.com/blog/2016/05/18/spark-mllib-from-quick-start-to-scikit-learn.html</span></p> <p> </p> <p><span style="font-family:Helvetica;">Hortonworks</span><span style="font-family: 宋体;">博客回顾?/span><span style="font-family:Helvetica;">Apache Storm</span><span style="font-family:宋体;">的历双Ӏ?/span><span style="font-family:Helvetica;">2011</span><span style="font-family:宋体;">q开源,</span><span style="font-family:Helvetica;">2013</span><span style="font-family:宋体;">q进?/span><span style="font-family:Helvetica;">Apache</span><span style="font-family:宋体;">孵化器,</span><span style="font-family:Helvetica;">2014</span><span style="font-family:宋体;">q成为顶U项目,今年初发布了</span><span style="font-family:Helvetica;">1.0</span><span style="font-family:宋体;">版。本文论qC每个里程的主要技术进步?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://hortonworks.com/blog/brief-history-apache-storm/</span></p> <p> </p> <p align="left"><span style="font-family:Helvetica;">HBaseCon</span><span style="font-family:宋体;">本周在旧金山召开。这ơ会议,</span><span style="font-family:Helvetica;">Apple</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Yahoo</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Facebook</span><span style="font-family:宋体;">都有演讲材料?/span></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline: none">http://hbasecon.com</span></a></p> <p> </p> <p><span style="font-family:Helvetica;">MapR</span><span style="font-family:宋体;">发图庆祝了过Mq中</span><span style="font-family: Helvetica;">Apache Drill</span><span style="font-family:宋体;">取得的成l。一q中发布?/span><span style="font-family:Helvetica;">7</span><span style="font-family:宋体;">个版本,完成了多个里E碑?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://www.mapr.com/blog/happy-anniversary-apache-drill-what-difference-year-makes</span></p> <p> </p> <p><span style="font-family:Helvetica;">Datanami</span><span style="font-family:宋体;">发布了在</span><span style="font-family:Helvetica;">Apache: Big Data North America</span><span style="font-family:宋体;">会议上,</span><span style="font-family:Helvetica;">ASF</span><span style="font-family:宋体;">ȝ</span><span style="font-family:Helvetica;">Jim Jagielski</span><span style="font-family: 宋体;">?/span><span style="font-family:Helvetica;">ODPi</span><span style="font-family:宋体;">目ȝ</span><span style="font-family:Helvetica;">John Mertic</span><span style="font-family:宋体;">的问{录Q如大家所料,主要话题q是</span><span style="font-family:Helvetica;">ASF</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">ODPi</span><span style="font-family:宋体;">的关pR?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://www.datanami.com/2016/05/20/apache-foundation-keeps-eyes-wide-open-odpi/</span></p> <p> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">产品发布</span></strong><strong></strong></p> <p align="left"><span style="font-family:Helvetica;">LinkedIn</span><span style="font-family:宋体;">开源了</span><span style="font-family:Helvetica;">Ambry</span><span style="font-family:宋体;">Q他们的</span><span style="font-family:Helvetica;">ObjectStore</span><span style="font-family: 宋体;">分布式系l?/span><span style="font-family:Helvetica;">Ambry</span><span style="font-family:宋体;">代码已提交到</span><span style="font-family:Helvetica;">github</span><span style="font-family:宋体;">Q这博文介l了</span><span style="font-family:Helvetica;">Ambry</span><span style="font-family:宋体;">的服务承诺,设计目标Q体pL构和接口?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">https://engineering.linkedin.com/blog/2016/05/introducing-and-open-sourcing-ambry---linkedins-new-distributed-</span></p> <p align="left"> </p> <p align="left"><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">apache HAWQ</span><span style="font-family:宋体;">Q孵化中Q驱动的</span><span style="font-family:Helvetica;">Pivotal HDB </span><span style="font-family:宋体;">本周发布?/span><span style="font-family:Helvetica;">2.0</span><span style="font-family:宋体;">版,</span><span style="font-family:Helvetica;">HDB</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Hadoop</span><span style="font-family:宋体;">提供了分析数据库?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">https://blog.pivotal.io/big-data-pivotal/products/fail-fast-and-ask-more-questions-of-your-data-with-hdb-2-0</span></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">Apache Mahout</span><span style="font-family:宋体;">本周发布?/span><span style="font-family:Helvetica;">0.12.1</span><span style="font-family:"MS Mincho";MS Mincho";">版,</span><span style="font-family:Helvetica;">Mahout</span><span style="font-family:宋体;">是一个机器学习和数据挖掘pȝ。本ơ发布旨在推q?/span><span style="font-family:Helvetica;">Flink</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Mahout</span><span style="font-family:宋体;">的集成?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">http://mail-archives.us.apache.org/mod_mbox/www-announce/201605.mbox/%3CCAOtpBjhshagyLN3Qnt0xRnc7YbnMVJjTS4piVXL7LiS2pQguXw@mail.gmail.com%3E</span></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">Apache Tajo</span><span style="font-family:宋体;">发布?/span><span style="font-family:Helvetica;">0.11.3</span><span style="font-family:宋体;">版?/span><span style="font-family:Helvetica;">Tajo</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Hadoop</span><span style="font-family:宋体;">的数据仓库。本ơ发布修正了</span><span style="font-family:Helvetica;">5</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">bug</span><span style="font-family:宋体;">?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">http://tajo.apache.org/releases/0.11.3/announcement.html</span></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">MongoDB</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Apache Spark</span><span style="font-family: 宋体;">发布了新?/span><span style="font-family:Helvetica;">MongoDB Connector</span><span style="font-family:宋体;">。除了对?/span><span style="font-family:Helvetica;">Spark</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Hadoop InputFormat shim</span><span style="font-family:宋体;">外,?/span><span style="font-family:Helvetica;">Connector</span><span style="font-family:宋体;">q有其他Ҏ。最后,q解释了</span><span style="font-family: Helvetica;">MongoDB</span><span style="font-family:宋体;">一些关键特性?/span></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">https://www.mongodb.com/blog/post/mongodb-connector-for-apache-spark-announcing-early-access-program-and-new-spark-training</span></a></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">http://rosslawley.co.uk/introducing-a-new=mongodb-spark-connector/</span></a></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">SyncSort</span><span style="font-family:宋体;">发布?/span><span style="font-family:Helvetica;">DMX-h v9</span><span style="font-family:宋体;">Q支?/span><span style="font-family:Helvetica;">Kafka</span><span style="font-family:宋体;">以及新的执行框架?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">http://insidebigdata.com/2016/05/20/syncsorts-latest-innovations-simplify-integration-of-streaming-data-in-spark-kafka-and-hadoop-for-real-time-analytics/</span></p> <p align="left"> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">zd</span></strong><strong></strong></p> <p align="left"><span style="font-size:14.0pt;font-family:SimSun;">中国</span></p> <p align="left"><span style="font-family:SimSun;">?/span></p><img src ="http://m.tkk7.com/rosen/aggbug/430838.html" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://m.tkk7.com/rosen/" target="_blank">Rosen</a> 2016-06-08 16:42 <a href="http://m.tkk7.com/rosen/archive/2016/06/08/430838.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>Hadoop周刊—第 169 ?/title><link>http://m.tkk7.com/rosen/archive/2016/05/15/430513.html</link><dc:creator>Rosen</dc:creator><author>Rosen</author><pubDate>Sun, 15 May 2016 12:30:00 GMT</pubDate><guid>http://m.tkk7.com/rosen/archive/2016/05/15/430513.html</guid><wfw:comment>http://m.tkk7.com/rosen/comments/430513.html</wfw:comment><comments>http://m.tkk7.com/rosen/archive/2016/05/15/430513.html#Feedback</comments><slash:comments>1</slash:comments><wfw:commentRss>http://m.tkk7.com/rosen/comments/commentRss/430513.html</wfw:commentRss><trackback:ping>http://m.tkk7.com/rosen/services/trackbacks/430513.html</trackback:ping><description><![CDATA[<p class="MsoNormal" align="left" style="text-align:left;line-height:10%; mso-outline-level:1"><strong style="mso-bidi-font-weight:normal"><span lang="EN-US" style="font-size:16.0pt;line-height:10%"><o:p> </o:p></span></strong></p><p align="left" style="line-height: 10%;"><br /></p><p align="left" style="line-height: 10%;"><strong><span style="font-size:16.0pt;line-height:10%">Hadoop</span></strong><strong><span style="font-size:16.0pt;line-height:10%;font-family:宋体;">周刊</span></strong><strong> </strong><strong><span style="font-size:16.0pt;line-height: 10%;font-family:宋体;">W?/span></strong><strong><span style="font-size:16.0pt;line-height:10%"> 169 </span></strong><strong><span style="font-size:16.0pt;line-height: 10%;font-family:宋体;">?/span></strong><strong></strong></p> <p align="left" style="line-height: 10%;"> </p> <p align="left" style="line-height: 10%;"> </p> <p align="left" style="line-height: 10%;"><span style="font-size:14.0pt;line-height:10%;font-family:宋体;">启明星辰q_和大数据整体l编?/span></p> <p align="left" style="line-height: 10%;"> </p> <p align="left" style="line-height: 10%;"> </p> <p align="left" style="line-height: 10%;"><span style="font-size:14.0pt;line-height:10%">2016</span><span style="font-size:14.0pt;line-height:10%;font-family:宋体;">q?/span><span style="font-size:14.0pt;line-height:10%">5</span><span style="font-size:14.0pt;line-height:10%;font-family:宋体;">?/span><span style="font-size:14.0pt;line-height:10%">8</span><span style="font-size:14.0pt;line-height:10%;font-family:宋体;">?/span></p> <p> </p> <p><span style="font-family:宋体;">本周内容短小_。主题覆?/span><span style="font-family:Helvetica;">Apache Beam</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">MapR</span><span style="font-family:宋体;">季度业W、最q的</span><span style="font-family:Helvetica;">Kafka</span><span style="font-family:宋体;">CQ以及来?/span><span style="font-family:Helvetica;">Cloudera</span><span style="font-family:宋体;">新开源的分布式单元测试框架?/span></p> <p> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">技术新?/span></strong><strong></strong></p> <p><span style="font-family:Helvetica;">Elastic</span><span style="font-family:宋体;">分析了宕Z件的Ҏ。错误配|?/span><span style="font-family: Helvetica;">ZooKeeper</span><span style="font-family:宋体;">内存讄会引赯度的</span><span style="font-family:Helvetica;">GC</span><span style="font-family:宋体;">Q这从Ҏ上导?/span><span style="font-family:Helvetica;">ZooKeeper</span><span style="font-family:宋体;">集群丢失。文章介l了一些缓解策略,用来防止未来cM问题的发生?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://www.elastic.co/blog/elastic-cloud-outage-april-2016</span></p> <p> </p> <p><span style="font-family:Helvetica;">Cask</span><span style="font-family:宋体;">博客明扼要的归纳了最q?/span><span style="font-family: Helvetica;">Big Data Applications Meetup</span><span style="font-family:宋体;">的花i。首先出场的?/span><span style="font-family:Helvetica;">Pachyderm</span><span style="font-family:宋体;">Q它Z</span><span style="font-family:Helvetica;">Docker</span><span style="font-family:宋体;">容器提供</span><span style="font-family:Helvetica;">“</span><span style="font-family:宋体;">数据</span><span style="font-family:Helvetica;">Git”</span><span style="font-family:宋体;">语义。第二个出场的是</span><span style="font-family: Helvetica;">TubeMogul</span><span style="font-family:宋体;">大数据^収ͼ</span><span style="font-family:Helvetica;">TubeMogul</span><span style="font-family:宋体;">构徏?/span><span style="font-family:Helvetica;">Hadoop</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Hive</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Spark</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Presto</span><span style="font-family:宋体;">之上?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://blog.cask.co/2016/05/pachyderm-and-tubemogul-share-their-big-data-application-platforms-and-experience/</span></p> <p> </p> <p><span style="font-family:Helvetica;">Google</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">dataArtisans</span><span style="font-family:宋体;">同时撰文介绍?/span><span style="font-family:Helvetica;">Apache Beam</span><span style="font-family:宋体;">Q前生是</span><span style="font-family:Helvetica;">Google Dataflow SDK</span><span style="font-family:宋体;">Q?/span><span style="font-family:Helvetica;">Google</span><span style="font-family:宋体;">的文章解释了Z开源和开?/span><span style="font-family:Helvetica;">Beam</span><span style="font-family:宋体;">的动机,</span><span style="font-family:Helvetica;">dataArtisans</span><span style="font-family: 宋体;">的文章介l他们对</span><span style="font-family:Helvetica;">Beam</span><span style="font-family:宋体;">模型的支持以及怎样考虑</span><span style="font-family:Helvetica;">Flink</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Beam API</span><span style="font-family:宋体;">之间的关pR?/span></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">https://cloud.google.com/blog/big-data/2016/05/why-apache-beam-a-google-perspective</span></a></p> <p><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration: none;text-underline:none">http://data-artisans.com/why-apache-beam/</span></a></p> <p> </p> <p><span style="font-family:Helvetica;">IBM Hadoop dev</span><span style="font-family: 宋体;">博客有个关于安装</span><span style="font-family:Helvetica;">Python</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Scala</span><span style="font-family:宋体;">和ؓ</span><span style="font-family:Helvetica;">Jupyter notebook</span><span style="font-family:宋体;">嵌入</span><span style="font-family:Helvetica;">R</span><span style="font-family:宋体;">内核的操作说明。同Ӟ也说明了怎样q接</span><span style="font-family:Helvetica;">Spark</span><span style="font-family:宋体;">和通过</span><span style="font-family:Helvetica;">SSL</span><span style="font-family:宋体;">暴露</span><span style="font-family:Helvetica;">notebook</span><span style="font-family:宋体;">?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://developer.ibm.com/hadoop/blog/2016/05/04/install-jupyter-notebook-spark/</span></p> <p> </p> <p><span style="font-family:宋体;">本文介绍?/span><span style="font-family:Helvetica;">Mongo Hadoop</span><span style="font-family:宋体;">的连接函数是如何Hv</span><span style="font-family: Helvetica;">Spark</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">MongoDB</span><span style="font-family:宋体;">的?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://x.ai/using-the-mongo-hadoop-connector-as-a-translation-layer-to-spark/</span></p> <p> </p> <p><span style="font-family:Helvetica;">Qubole</span><span style="font-family:宋体;">博客撰文比较了用于大数据分析的流行编E语a</span><span style="font-family:Helvetica;">—Python</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">R</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Scala</span><span style="font-family:宋体;">?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://www.qubole.com/blog/big-data/programming-language/</span></p> <p> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">其他新闻</span></strong><strong></strong></p> <p><span style="font-family:Helvetica;">MapR</span><span style="font-family:宋体;">宣布本季度他们授权下单创U录的增长了</span><span style="font-family:Helvetica;">99%</span><span style="font-family:宋体;">Q以?/span><span style="font-family:Helvetica;">146%</span><span style="font-family:宋体;">的美元净增长率?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://www.mapr.com/company/press-releases/mapr-achieves-another-record-quarter-99-software-subscription-license-growth</span></p> <p> </p> <p><span style="font-family:宋体;">本文描述了最q?/span><span style="font-family:Helvetica;">Google Cloud Dataflow</span><span style="font-family: 宋体;">?/span><span style="font-family:Helvetica;">Apache Spark</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Google Compute Engine</span><span style="font-family:宋体;">上的基准试表现?/span><span style="font-family:Helvetica;">Dataflow</span><span style="font-family:宋体;">胜过</span><span style="font-family:Helvetica;">Spark2</span><span style="font-family:宋体;">Q?/span><span style="font-family:Helvetica;">5.7</span><span style="font-family:宋体;">倍(一直以来,最好是在自q环境下评估工作负载,而不是一味的信Q基准试Q。本文还解释了一U?/span><span style="font-family:Helvetica;">“</span><span style="font-family:宋体;">h</span><span style="font-family:Helvetica;">”</span><span style="font-family:宋体;">Q通过它每个使用大数据工L益?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://www.datanami.com/2016/05/02/dataflow-tops-spark-benchmark-test/</span></p> <p> </p> <p><span style="font-family:Helvetica;">Confluent</span><span style="font-family:宋体;">博客回顾了最q召开?/span><span style="font-family: Helvetica;">Kafka</span><span style="font-family:宋体;">CQ包括编E挑战预选赛Q主题演Ԍ分组会议{等?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://www.confluent.io/blog/log-compaction-kafka-summit-edition-may-2016</span></p> <p> </p> <p><span style="font-family:宋体;">布斯介l了国q通在q去</span><span style="font-family:Helvetica;">5</span><span style="font-family:宋体;">q间采用大数据技术的历程。本文中Q美国运通分享了一些技巧和学到的经验教训,例如采用新技术的困难Q得到组l高层的认同是多么的重要Q,以及雇䄦和留住工E师的挑战等{?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://www.forbes.com/sites/ciocentral/2016/04/27/inside-american-express-big-data-journey/</span></p> <p> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">产品发布</span></strong><strong></strong></p> <p align="left"><span style="font-family:Helvetica;">Cask</span><span style="font-family:宋体;">发布?/span><span style="font-family:Helvetica;">Cask Data Application Platform (CDAP)3.4</span><span style="font-family:宋体;">版本?/span><span style="font-family:"MS Mincho";MS Mincho";">新版本增加了</span><span style="font-family:Helvetica;">Cask Tracker</span><span style="font-family: 宋体;">Q新的数据集?/span><span style="font-family:Helvetica;">/</span><span style="font-family:宋体;">审计</span><span style="font-family:Helvetica;">/</span><span style="font-family:宋体;">搜烦pȝQ升U了</span><span style="font-family:Helvetica;">Cask Hydrator</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">UI</span><span style="font-family:宋体;">Q增Z?/span><span style="font-family:Helvetica;">Spark</span><span style="font-family:宋体;">的支持等{?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">http://blog.cask.co/2016/05/announcing-cdap-release-3-4-introducing-tracker-next-gen-hydrator-enhanced-spark-support-and-much-more/</span></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">Cloudera</span><span style="font-family:宋体;">开源了</span><span style="font-family:Helvetica;">“dist_tes”</span><span style="font-family:宋体;">Qƈ行执行单元测试的新工兗通过该工P?/span><span style="font-family:Helvetica;">Hadoop</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Kudu</span><span style="font-family:宋体;">目q行单元试Q可以在数分钟而不是数时内完成。该工具l定?/span><span style="font-family: Helvetica;">C++</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Java</span><span style="font-family:宋体;">Qƈ在网站上演示了这些特性?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">http://blog.cloudera.com/blog/2016/05/quality-assurance-at-cloudera-distributed-unit-testing/</span></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">Google</span><span style="font-family:宋体;">宣布</span><span style="font-family:Helvetica;">Google BigQuery</span><span style="font-family: 宋体;">?/span><span style="font-family:Helvetica;">Drive</span><span style="font-family:宋体;">可集成在一P把输Z存到</span><span style="font-family:Helvetica;">Google sheets</span><span style="font-family:宋体;">?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">http://techcrunch.com/2016/05/06/google-connects-bigquery-to-google-drive-and-sheets/</span></p> <p align="left"> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">zd</span></strong><strong></strong></p> <p align="left"><span style="font-size:14.0pt;font-family:SimSun;">中国</span></p> <p align="left"><span style="font-family:SimSun;">?/span></p><img src ="http://m.tkk7.com/rosen/aggbug/430513.html" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://m.tkk7.com/rosen/" target="_blank">Rosen</a> 2016-05-15 20:30 <a href="http://m.tkk7.com/rosen/archive/2016/05/15/430513.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>Hadoop周刊—第 168 ?/title><link>http://m.tkk7.com/rosen/archive/2016/05/07/430401.html</link><dc:creator>Rosen</dc:creator><author>Rosen</author><pubDate>Sat, 07 May 2016 15:37:00 GMT</pubDate><guid>http://m.tkk7.com/rosen/archive/2016/05/07/430401.html</guid><wfw:comment>http://m.tkk7.com/rosen/comments/430401.html</wfw:comment><comments>http://m.tkk7.com/rosen/archive/2016/05/07/430401.html#Feedback</comments><slash:comments>0</slash:comments><wfw:commentRss>http://m.tkk7.com/rosen/comments/commentRss/430401.html</wfw:commentRss><trackback:ping>http://m.tkk7.com/rosen/services/trackbacks/430401.html</trackback:ping><description><![CDATA[<p align="left" style="line-height: 10%;"><strong> </strong></p> <p align="left" style="line-height: 10%;"><strong><span style="font-size:16.0pt;line-height:10%">Hadoop</span></strong><strong><span style="font-size:16.0pt;line-height:10%;font-family:宋体;">周刊</span></strong><strong> </strong><strong><span style="font-size:16.0pt;line-height: 10%;font-family:宋体;">W?/span></strong><strong><span style="font-size:16.0pt;line-height:10%"> 168 </span></strong><strong><span style="font-size:16.0pt;line-height: 10%;font-family:宋体;">?/span></strong><strong></strong></p> <p align="left" style="line-height: 10%;"> </p> <p align="left" style="line-height: 10%;"> </p> <p align="left" style="line-height: 10%;"><span style="font-size:14.0pt;line-height:10%;font-family:宋体;">启明星辰q_和大数据整体l编?/span></p> <p align="left" style="line-height: 10%;"> </p> <p align="left" style="line-height: 10%;"> </p> <p align="left" style="line-height: 10%;"><span style="font-size:14.0pt;line-height:10%">2016</span><span style="font-size:14.0pt;line-height:10%;font-family:宋体;">q?/span><span style="font-size:14.0pt;line-height:10%">5</span><span style="font-size:14.0pt;line-height:10%;font-family:宋体;">?/span><span style="font-size:14.0pt;line-height:10%">1</span><span style="font-size:14.0pt;line-height:10%;font-family:宋体;">?/span></p> <p> </p> <p><span style="font-family:Helvetica;">Kafka</span><span style="font-family:宋体;">C本周在旧金山召开Q不容置疑本周期刊将有大量的</span><span style="font-family:Helvetica;">Kafka</span><span style="font-family:宋体;">内容。除此以外,q有大量关于</span><span style="font-family:Helvetica;">Impala</span><span style="font-family:宋体;">性能?/span><span style="font-family:Helvetica;">Kudu</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Druid</span><span style="font-family:宋体;">斚w的文章。在其他新闻部分Q?/span><span style="font-family:Helvetica;">Apache Apex</span><span style="font-family:宋体;">成ؓ?/span><span style="font-family:Helvetica;">Apache</span><span style="font-family:宋体;">的顶U项目,</span><span style="font-family:Helvetica;">Qubole</span><span style="font-family:宋体;">开源了?/span><span style="font-family:Helvetica;">StreamX</span><span style="font-family:宋体;">目?/span></p> <p> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">技术新?/span></strong><strong></strong></p> <p><span style="font-family: 宋体;">本文快速浏览了如何在可能或不可能创建新数据分区的情况下操作</span><span style="font-family:Helvetica;">Spark RDD</span><span style="font-family:宋体;">。尤?/span><span style="font-family:Helvetica;">`mapValues`</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">`filter`</span><span style="font-family:宋体;">会保存分?/span><span style="font-family:Helvetica;">`map`</span><span style="font-family:宋体;">却不会?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://medium.com/@corentinanjuna/apache-spark-rdd-partitioning-preservation-2187a93bc33e</span></p> <p><span style="position: relative;z-index:251659264"><span style="left:0px;position:absolute;left:-10px; top:-133px;width:433px;height:19px"><img width="433" height="19" src="file://localhost/Users/jiangrongsheng/Library/Group%20Containers/UBF8T346G9.Office/msoclip1/01/clip_image001.png" v:shapes="直线q接W_x0020_1" alt="" /></span></span> </p> <br clear="ALL" /> <p><span style="font-family:宋体;">本文介绍了如何?/span><span style="font-family:Helvetica;">Conda</span><span style="font-family:宋体;">构徏独立?/span><span style="font-family:Helvetica;">Python</span><span style="font-family:宋体;">环境Q例?/span><span style="font-family:Helvetica;">pandas</span><span style="font-family:宋体;">插gQ,以便做ؓ</span><span style="font-family:Helvetica;">Spark job</span><span style="font-family:宋体;">的一部分装蝲到集节炏V经q这L处理Q就能在没有</span><span style="font-family:Helvetica;">python</span><span style="font-family:宋体;">原生包被安装在主操作pȝ上的情况下运?/span><span style="font-family:Helvetica;">PySpark job</span><span style="font-family:宋体;">。这U方案同样适用?/span><span style="font-family:Helvetica;">SparkR</span><span style="font-family:宋体;">?/span></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">http://quasiben.github.io/blog/2016/4/15/conda-spark/</span></a></p> <p> </p> <p><span style="font-family:Helvetica;">Datadog</span><span style="font-family:宋体;">博客有三监?/span><span style="font-family:Helvetica;">Kafka</span><span style="font-family:宋体;">的系列文章。第一详l概括了</span><span style="font-family:Helvetica;">broker</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">producer</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">consumers</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">ZooKeeper</span><span style="font-family:宋体;">的关键度量指标。第二篇介绍了怎样?/span><span style="font-family:Helvetica;">JConsole</span><span style="font-family:宋体;">和其他工具上通过</span><span style="font-family:Helvetica;">JMX</span><span style="font-family:宋体;">查看指标Q第三篇介绍?/span><span style="font-family: Helvetica;">Datadog</span><span style="font-family:宋体;">集成斚w的知识?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://www.datadoghq.com/blog/monitoring-kafka-performance-metrics/</span></p> <p> </p> <p><span style="font-family:Helvetica;">Salesforce</span><span style="font-family:宋体;">撰文介绍?/span><span style="font-family:Helvetica;">Kafka</span><span style="font-family:宋体;">在他们组l内的成长史。最初,他们借助</span><span style="font-family:Helvetica;">Kafka</span><span style="font-family:宋体;">驱动了操作指标分析功能,渐渐地成Z个驱动众多系l的大^台?/span><span style="font-family: Helvetica;">Salesforce</span><span style="font-family:宋体;">q用</span><span style="font-family:Helvetica;">Kafka</span><span style="font-family:宋体;">在多个数据中心运行,q?/span><span style="font-family:Helvetica;">MirrorMaker</span><span style="font-family:宋体;">在集间复制和聚合数据?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://medium.com/salesforce-engineering/expanding-visibility-with-apache-kafka-e305b12c4aba#.5k7j921o3</span></p> <p> </p> <p><span style="font-family:Helvetica;">Metamarkets</span><span style="font-family: 宋体;">博客有一关于优化大规模分布式系l的有趣博文?/span><span style="font-family:Helvetica;">Druid</span><span style="font-family:宋体;">Q他们的分布式数据仓库,最q增加了一U?/span><span style="font-family:Helvetica;">"</span><span style="font-family:宋体;">先进先出</span><span style="font-family:Helvetica;">"</span><span style="font-family:宋体;">的查询模式,q在重型负蝲大集间q行了测试。根据他们的假设Q推Q何可能发生和攉到有的的指标?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://metamarkets.com/2016/impact-on-query-speed-from-forced-processing-ordering-in-druid/</span></p> <p> </p> <p><span style="font-family:Helvetica;">Google Cloud Big Data</span><span style="font-family:宋体;">博客撰文介绍?/span><span style="font-family:Helvetica;">BigQuery</span><span style="font-family:宋体;">的内部存储格式,容器Q以及其它得存储数据更有效率的优化措施?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://cloud.google.com/blog/big-data/2016/04/inside-capacitor-bigquerys-next-generation-columnar-storage-format</span></p> <p> </p> <p><span style="font-family:Helvetica;">Apache Kudu</span><span style="font-family: 宋体;">Q孵化中Q博客概qC最q?/span><span style="font-family: Helvetica;">YCSB</span><span style="font-family:宋体;">工具对系l性能分析和调优的l果?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://getkudu.io/2016/04/26/ycsb.html</span></p> <p> </p> <p><span style="font-family:Helvetica;">Impala 2.5</span><span style="font-family:宋体;">无论?/span><span style="font-family:Helvetica;">TPC</span><span style="font-family:宋体;">基准试q是其它斚w均有显著的性能提升。提升项包括q行时过滤器Q?/span><span style="font-family:Helvetica;">LLVM</span><span style="font-family:宋体;">代码生成器对</span><span style="font-family:Helvetica;">`SORT`</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">`DECIMAL`</span><span style="font-family:宋体;">的支持,更快?/span><span style="font-family:Helvetica;">metadata-only</span><span style="font-family:宋体;">查询Q等{?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://blog.cloudera.com/blog/2016/04/apache-impala-incubating-in-cdh-5-7-4x-faster-for-bi-workloads-on-apache-hadoop/</span></p> <p> </p> <p><span style="font-family:宋体;">本文介绍了,为支持高可用性,如何?/span><span style="font-family:Helvetica;">Hive Metastore</span><span style="font-family:宋体;">配置</span><span style="font-family:Helvetica;">MariaDB</span><span style="font-family:宋体;">的?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://developer.ibm.com/hadoop/blog/2016/04/26/bigsql-ha-configure-ha-hive-metastore-db-using-mariadb10-1/</span></p> <p> </p> <p><span style="font-family:Helvetica;">Altiscale</span><span style="font-family:宋体;">博客撰文介绍了寻?/span><span style="font-family:Helvetica;">NodeGroup</span><span style="font-family:宋体;">相关</span><span style="font-family:Helvetica;">bug</span><span style="font-family:宋体;">的过E(跟进三月的文章)。如果你因没扑ֈ</span><span style="font-family:Helvetica;">Hadoop</span><span style="font-family:宋体;">Q或其他分布式系l)?/span><span style="font-family:Helvetica;">bug</span><span style="font-family:宋体;">根结而气馁,不要Ҏ。本文告诉你q的困难,甚至需要程序员在销?/span><span style="font-family:Helvetica;">Hadoop</span><span style="font-family:宋体;">服务的企业干zL能搞定?/span></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">https://www.altiscale.com/blog/part-1-2-investigation-analysis-and-resolution-of-nodegroup-performance-issues-on-bare-metal-hardware-clusters/</span></a></p> <p> </p> <p><span style="font-family:Helvetica;">Netflix</span><span style="font-family:宋体;">现在q行了超q?/span><span style="font-family:Helvetica;">4000</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Kafka </span><span style="font-family:Helvetica;">broker</span><span style="font-family:宋体;">Q横?/span><span style="font-family:Helvetica;">36</span><span style="font-family:宋体;">个集。在云中q行</span><span style="font-family:Helvetica;">Kafka</span><span style="font-family:宋体;">需要一些权衡,团队q了开销和数据丢失(日数据丢失小?/span><span style="font-family:Helvetica;">0.01%</span><span style="font-family:宋体;">Q。本文分享了团队?/span><span style="font-family:Helvetica;">AWS</span><span style="font-family:宋体;">中运?/span><span style="font-family:Helvetica;">Kafka</span><span style="font-family:宋体;">的经验,主要是一些典型问题,部v{略Q小集群、隔ȝ</span><span style="font-family:Helvetica;">zookeeper</span><span style="font-family:宋体;">集群Q,集群U容错,支持</span><span style="font-family:Helvetica;">AWS availability zones</span><span style="font-family: 宋体;">Q?/span><span style="font-family:Helvetica;">Kafka UI</span><span style="font-family:宋体;">可视化等{?/span></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">http://techblog.netflix.com/2016/04/kafka-inside-keystone-pipeline.html</span></a></p> <p align="left"> </p> <p><span style="font-family:Helvetica;">Amazon</span><span style="font-family:宋体;">大数据博客撰文介l了如何?/span><span style="font-family: Helvetica;">Amazon EMR</span><span style="font-family:宋体;">加密数据存放?/span><span style="font-family:Helvetica;">S3</span><span style="font-family:宋体;">中。这U集成方式同时支持客L和服务器端加密(借助?/span><span style="font-family:Helvetica;">Amazon KMS</span><span style="font-family:宋体;">Q?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">http://blogs.aws.amazon.com/bigdata/post/TxBQTAF 3X7VLEP/Process-Encrypted-Data-in-Amazon-EMR-with-Amazon-S3-and-AWS-KMS</span></p> <p> </p> <p><span style="font-family:Helvetica;">TubeMogul</span><span style="font-family:宋体;">介绍了他们大数据q_的历Ԍ该^台每月支撑万亿次数据分析h。该团队很早p?/span><span style="font-family:Helvetica;">Amazon EMR</span><span style="font-family:宋体;">Q导入了</span><span style="font-family:Helvetica;">Storm</span><span style="font-family:宋体;">实时处理技术,最l把大数据服务落在了</span><span style="font-family:Helvetica;">Qubole</span><span style="font-family:宋体;">上?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://www.tubemogul.com/engineering/the-big-data-lifecycle-at-tubemogul/</span></p> <p> </p> <p><span style="font-family:Helvetica;">Caffe</span><span style="font-family:宋体;">Q深度学习框Ӟ?/span><span style="font-family:Helvetica;">Spark</span><span style="font-family:宋体;">q行了集?/span><span style="font-family:Helvetica;">—CaffeOnSpark</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">MapR</span><span style="font-family:宋体;">公司撰文介绍了如何在</span><span style="font-family:Helvetica;">MapR YARN</span><span style="font-family:宋体;">上运行,文章q包括了采用的性能优化手段?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://www.mapr.com/blog/distributed-deep-learning-caffe-using-mapr-cluster</span></p> <p> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">其他新闻</span></strong><strong></strong></p> <p><span style="font-family:Helvetica;">Apache Apex</span><span style="font-family: 宋体;">Q大数据式处理和批处理pȝQ现在成Z</span><span style="font-family:Helvetica;">Apache</span><span style="font-family:宋体;">软g基金会的目?/span><span style="font-family:Helvetica;">Apex</span><span style="font-family:宋体;">d</span><span style="font-family:Helvetica;">8</span><span style="font-family:宋体;">月进入孵化器?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://blogs.apache.org/foundation/entry/the_apache_ software_foundation_announces90</span></p> <p> </p> <p><span style="font-family:Helvetica;">Heroku Kafka</span><span style="font-family: 宋体;">Q是一个分支于</span><span style="font-family:Helvetica;">Heroku</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Kafka</span><span style="font-family:宋体;">理服务。最q接q发?/span><span style="font-family:Helvetica;">beta</span><span style="font-family:宋体;">版?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://blog.heroku.com/archives/2016/4/26/announcing-heroku-kafka-early-access</span></p> <p> </p> <p><span style="font-family:Helvetica;">MapR</span><span style="font-family:宋体;">博客上的一文章强调ؓ什么性别多样性是重要的,q提C大数据论坛中的女性,本文旨在鼓励x投w于q一领域?/span><span style="font-family:Helvetica;">“</span><span style="font-family:宋体;">大数据论坛中的女?/span><span style="font-family:Helvetica;">”</span><span style="font-family:宋体;">研讨会本周由</span><span style="font-family:Helvetica;">MapR</span><span style="font-family:宋体;">l织在圣何塞召开?/span></p> <p><span style="font-family:Helvetica;color:#386EFF;">https://www.mapr.com/blog/case-women-big-data</span></p> <p> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">产品发布</span></strong><strong></strong></p> <p align="left"><span style="font-family:Helvetica;">StreamX</span><span style="font-family:宋体;">是一个来?/span><span style="font-family:Helvetica;">Qubole</span><span style="font-family:宋体;">的开源项目,它能?/span><span style="font-family:Helvetica;">Kafka</span><span style="font-family:宋体;">拯数据?/span><span style="font-family:Helvetica;">Amazon S3</span><span style="font-family:宋体;">q样的目标存储中?/span><span style="font-family:Helvetica;">Qubole</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">StreamX</span><span style="font-family:宋体;">作ؓ一U管理服务提供?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">http://www.qubole.com/blog/big-data/streamx/</span></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">SnappyData</span><span style="font-family:宋体;">是一个ؓ</span><span style="font-family:Helvetica;">OLAP</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">OLTP</span><span style="font-family:宋体;">查询式数据的新q_Q和公司Q?/span><span style="font-family:Helvetica;">SnappyData</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Apache Spark</span><span style="font-family: 宋体;">?/span><span style="font-family:Helvetica;">GemFire</span><span style="font-family:宋体;">的内存存储技术驱动?/span></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">http://www.infoworld.com/article/3062022/sql/apache-spark-powers-live-sql-analytics-in-snappydata.html</span></a></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration: none;text-underline:none">http://www.snappydata.io/</span></a></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">Apache Geode</span><span style="font-family:宋体;">Q孵化中Q发布了</span><span style="font-family:Helvetica;">1.0.0-incubating.M2</span><span style="font-family:宋体;">版本Q它是一个分布式数据q_Q瞄准高性能和低延迟。新版本提供了广域网下的点对点连接等新特性?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">http://mail-archives.apache.org/mod_mbox/incubator-geode-dev/201604.mbox/%3CCAFh%2B7k2eiK2TMGK sLqrY9CZDjxjYwiuTQ4QGUVC2s3geyJYwnA% 40mail.gmail.com%3E</span></p> <p align="left"> </p> <p align="left"><span style="font-family:Helvetica;">Apache Knox</span><span style="font-family:宋体;">发布?/span><span style="font-family:Helvetica;">0.9.0</span><span style="font-family:宋体;">版,它是</span><span style="font-family:Helvetica;">Hadoop</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">REST API</span><span style="font-family:宋体;">|关。新版本?/span><span style="font-family:Helvetica;">Ranger</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Ambari</span><span style="font-family:宋体;">提供?/span><span style="font-family:Helvetica;">UI</span><span style="font-family:宋体;">界面支持Q以及一些其它的提升?/span><span style="font-family:Helvetica;">bug</span><span style="font-family:宋体;">修复?/span></p> <p align="left"><span style="font-family:Helvetica; color:#386EFF;">http://mail-archives.us.apache.org/mod_mbox/www-announce/201604.mbox/%3CCACRbFyjRF7zShb-NQ29d3FJ0hKZ57ts0Qfo31ffuNODpskwqPQ @mail.gmail.com%3E</span></p> <p align="left"> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">zd</span></strong><strong></strong></p> <p align="left"><span style="font-size:14.0pt;font-family:SimSun;">中国</span></p> <p align="left"><span style="font-family:SimSun;">?/span></p><img src ="http://m.tkk7.com/rosen/aggbug/430401.html" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://m.tkk7.com/rosen/" target="_blank">Rosen</a> 2016-05-07 23:37 <a href="http://m.tkk7.com/rosen/archive/2016/05/07/430401.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>Hadoop周刊—第 167 ?http://m.tkk7.com/rosen/archive/2016/05/03/430325.htmlRosenRosenTue, 03 May 2016 02:08:00 GMThttp://m.tkk7.com/rosen/archive/2016/05/03/430325.htmlhttp://m.tkk7.com/rosen/comments/430325.htmlhttp://m.tkk7.com/rosen/archive/2016/05/03/430325.html#Feedback0http://m.tkk7.com/rosen/comments/commentRss/430325.htmlhttp://m.tkk7.com/rosen/services/trackbacks/430325.htmlHadoop周刊 W?/span> 167 ?br />



启明星辰q_和大数据整体l编?/span>



2016
q?/span>4?/span>25?/span>

 

Ƣ迎来到Hadoop周刊周一特别版。本周有大量来自Spark?/span>Kafka?/span>Beam?/span>Kudu的技术新闅R如果你正在L一些更前沿的技术,Apache MetronQ孵化中Q发布了它们W一个版本?/span>MetronQ是一个构建在Hadoop上正在不断发展的通用安全pȝ?/span>

 

技术新?/span>

本文介绍了如何在AWS上构建流式处理系l。包括了诸如Amazon Kinesis ?/span>AWS Lambda?/span>Kineses S3 connector之类单的搭配ҎQ也介绍?/span>AWS实现实时分析场景q样相对复杂点的Ҏ?/span>

http://cdn.oreillystatic.com/en/assets/1/event/144/Building%20a%20scalable%20architecture%20for%20processing%20streaming%20data%20on%20AWS%20Presentation.pdf

 

本文介绍了怎样使用Spark Testing Base?/span>Spark Testing Base是一个用Scala~写Q通过Java调用?/span>Spark试框架。本文的样例代码展示了如何隔L试逻辑重构Spark代码Q同时还通过Java处理了一些臃肿的Scala API?/span>

http://www.jesse-anderson.com/2016/04/unit-testing-spark-with-java/

 

Altiscale博客概述了在Spark环境下,构徏thin?/span>uber jar包的优劣。示范了?/span>Maven?/span>SBT分别构徏两种包的情况?/span>

https://www.altiscale.com/blog/spark-on-hadoop-thin-jars/

 

LinkedIn介绍了他们的Kafka生态系l,生态系l包含一个特D的Kafka producerQ一个ؓ?/span>Java客户端提供的REST APIQ一?/span>avro模式注册表,以及GobblinQ装载数据到Hadoop的工P{等?/span>

https://engineering.linkedin.com/blog/2016/04/kafka-ecosystem-at-linkedin

 

?/span>Spark Streaming教程介绍了怎样通过twitter4j API拉推文,Z标签qoQ对推文q行情感分析?/span>

https://www.mapr.com/blog/spark-streaming-and-twitter-sentiment-analysis

 

Apache KuduQ孵化中Q是Apache ImpalaQ孵化中Q的l佳伴GQ因为它能高效地解决q泛的分析和有针Ҏ的查询。本文描qC两者集成的技术细节,例如Kudu的设计如何保证高效地查询能力Q如何通过Impala?/span>Kudu执行写/更新Q删除操作等{?/span>

http://blog.cloudera.com/blog/2016/04/how-to-use-impala-and-kudu-together-for-analytic-workloads/

 

MapR撰文介绍了?/span>spark-sklearn扩展一个已存在?/span>scikit-learn模型。文章介l了如何透过Airbnb数据集内部徏模,q介l了如何傍着spark-sklearnq行交叉验证?/span>

https://www.mapr.com/blog/predicting-airbnb-listing-prices-scikit-learn-and-apache-spark

 

AWS大数据博客写了个如何?/span>Amazon EMR中?/span>HBase?/span>Hive的教E。本教程介绍?/span>HBaseQ描qC如何?/span>S3中恢?/span>HBase表,C?/span>Hive?/span>HBase如何集成{等?/span>

http://blogs.aws.amazon.com/bigdata/post/Tx3EGE8Z90LZ9WX/Combine-NoSQL-and-Massively-Parallel-Analytics-Using-Apache-HBase-and-Apache-Hiv

 

本文描述了ؓ学生在大数据评上提供实战经验的挑战。作者经历若q次的P代和选择g有了一个好Ҏ Altiscale?/span>Hadoop-as-a-Service?/span>

https://www.altiscale.com/blog/hadoop-as-a-service-in-the-classroom/

 

Cloudera博客的一客做文章,作者比较了Parquet?/span>Avro在跨两个数据集的不同处理方式Q一个数据集H?/span>(3?/span>)、一个数据集?/span>(103?/span>)Q。在?/span>Spark?/span>Spark SQL试查询Q操作后Q作者发?/span>Parquet?/span>Avro在查询序列化数据斚w有时表现很类|管在大多数情况下查?/span>Parquet数据的时候更快点Q序列化数据更小Q?/span>

http://blog.cloudera.com/blog/2016/04/benchmarking-apache-parquet-the-allstate-experience/

 

本文介绍了如何在CDHq样的分布式环境中?/span>SparkRQ尽?/span>SparkR官方q没有支持这U方式。借助YARN?/span>worker本地安装R语言包,jobE加攚w就能执行了?/span>

http://www.nodalpoint.com/sparkr-in-cloudera-hadoop/

 

很多开源框枉能执?/span>MapReduce以及借助更高U的~程模型完成cM的工作。纵观过去,它们依赖独立q行的框Ӟ例如MapReduce, StormQ,但是最q的某些变化使得q一切充满了变数?/span>Apache BeamQ孵化中Q更q一步地跨越了批处理、流式处理两U执行模式,内置更加复杂的计模型?/span>

http://www.datanami.com/2016/04/22/apache-beam-emerges-ambitious-goal-unify-big-data-development/

 

Apache博客发布?/span>HBase?/span>HDD?/span>SSD以及RAMDISK上的写入性能试比对?/span>7系列文章。通过q一分析Q作者发现ƈ提议?/span>HBase?/span>HDFS上实C些未覆盖的功能?/span>

https://blogs.apache.org/hbase/entry/hdfs_hsm_and_hbase_part

 

其他新闻

Tom WhiteQ?/span>“Hadoop权威指南的作者撰文介l他是如何步?/span>Apache HadoopD堂的。他的早期A献是l着Hadoop?/span>Amazon Web Services集成展开Q而今AWS已成?/span>Hadoop目成功的重要部分?/span>

http://vision.cloudera.com/how-i-got-into-hadoop/

 

FluoApache Accumulo准备的分布式处理引擎Q向Apache孵化器提交了孵化甌?/span>

https://wiki.apache.org/incubator/FluoProposal

 

Apache Phoenix宣布在HBaseCon后D行会议,Apache Phoenix是一?/span>SQL-on-HBasepȝ。该会议只有半天Q主题是介绍Phoenix内部情况和用例?/span>

http://hortonworks.com/blog/announcing-first-annual-phoenixcon-apache-phoenix-user-conference/

 

产品发布

Apache MetronQ构ZHadoop上的安全框架Q发布了0.1版?/span>Hortonworks支撑其作为技术预览版Qƈ撰写本文介绍了如何上手,如何贡献Q如何?/span>Metron UI{等?/span>

http://hortonworks.com/blog/apache-metron-tech-preview-1-come-get/

http://hortonworks.com/blog/apache-metron-use-case-finding-needle-haystack/

 

Apache NiFi本周发布?/span>0.6.1版。这是修复了10多个bug后的修复版?/span>

http://mail-archives.us.apache.org/mod_mbox/www-announce/201604.mbox/%3CCALJK9a7yLnFeJ7Z=eU6mOB-DXvo8MHUr=_RshSjZcTbTcAHDZA@mail.gmail.com%3E

 

Apache Flink本周发布?/span>1.0.2版。本ơ发布包括了bug修复Q?/span>RocksDB环境下的性能提升以及一些文档方面的q步?/span>

http://flink.apache.org/news/2016/04/22/release-1.0.2.html

 

Amazon发布了新?/span>Amazon EMRQ开始支?/span>HBase 1.2?/span>

https://aws.amazon.com/blogs/aws/amazon-emr-update-apache-hbase-1-2-is-now-available/

 

zd

中国

?/span>



Rosen 2016-05-03 10:08 发表评论
]]>
Hadoop周刊—第 166 ?/title><link>http://m.tkk7.com/rosen/archive/2016/04/21/430176.html</link><dc:creator>Rosen</dc:creator><author>Rosen</author><pubDate>Thu, 21 Apr 2016 07:07:00 GMT</pubDate><guid>http://m.tkk7.com/rosen/archive/2016/04/21/430176.html</guid><wfw:comment>http://m.tkk7.com/rosen/comments/430176.html</wfw:comment><comments>http://m.tkk7.com/rosen/archive/2016/04/21/430176.html#Feedback</comments><slash:comments>0</slash:comments><wfw:commentRss>http://m.tkk7.com/rosen/comments/commentRss/430176.html</wfw:commentRss><trackback:ping>http://m.tkk7.com/rosen/services/trackbacks/430176.html</trackback:ping><description><![CDATA[<p><strong><span style="font-size:16.0pt">Hadoop</span></strong><strong><span style="font-size:16.0pt;font-family: 宋体;">周刊</span></strong><strong> </strong><strong><span style="font-size:16.0pt;font-family:宋体;">W?/span></strong><strong><span style="font-size:16.0pt"> 166 </span></strong><strong><span style="font-size: 16.0pt;font-family:宋体;">?/span></strong><strong></strong></p> <p><span style="font-size:14.0pt">2016</span><span style="font-size:14.0pt;font-family:宋体;">q?/span><span style="font-size:14.0pt">4</span><span style="font-size:14.0pt; font-family:宋体;">?/span><span style="font-size:14.0pt">17</span><span style="font-size:14.0pt;font-family: 宋体;">?/span></p> <p>启明星辰——q_和大数据整体l编?nbsp;<br /><br /></p> <p><span style="font-family:Helvetica;">Hortonworks</span><span style="font-family: 宋体;">在本?/span><span style="font-family:Helvetica;">Hadoop</span><span style="font-family:宋体;">Ƨ洲C上有若干爆料Q诏I了本期整个内容。伴随着骄h的新Ҏ,</span><span style="font-family:Helvetica;">Apache Storm</span><span style="font-family:宋体;">发布?/span><span style="font-family:Helvetica;">1.0.0</span><span style="font-family:宋体;">版。在技术新L面,有不基?/span><span style="font-family:Helvetica;">Kafka</span><span style="font-family:宋体;">构徏大规模服务和分布式系l测试的文章。如果你错过?/span><span style="font-family:Helvetica;">Hadoop</span><span style="font-family:宋体;">CQ那么不用担心,演讲视频已经攑ֈ了网上?/span></p> <p> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">技术新?/span></strong><strong></strong></p> <p> </p> <p><span style="font-family:Helvetica;">Smyte</span><span style="font-family:宋体;">撰文介绍了他们基于事件数据流实时垃N件和诈骗信息的基设施。最初的事g处理pȝ构徏?/span><span style="font-family:Helvetica;">Kafka</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Redis</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Secor</span><span style="font-family:宋体;">以及</span><span style="font-family:Helvetica;">S3</span><span style="font-family:宋体;">上,Z满规模不断扩张和廉L要求Q他们把pȝq移到基于磁盘的Ҏ上,使用</span><span style="font-family:Helvetica;">Redis</span><span style="font-family:宋体;">协议?/span><span style="font-family:Helvetica;">RocksDB</span><span style="font-family:宋体;">交互Q?/span><span style="font-family:Helvetica;">Kafka</span><span style="font-family:宋体;">q行复制?/span></p> <p><a ><span style="font-family:Helvetica;">https://medium.com/the-smyte-blog/counting-with-domain-specific-databases-73c660472da</span></a></p> <p><u> </u></p> <p><span style="font-family:宋体;">本文?/span><span style="font-family:Helvetica;">rsyslog</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Kafka</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">AWS </span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">ELK</span><span style="font-family:宋体;">栈(</span><span style="font-family:Helvetica;">ElasticSearch</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Logstash</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Kibana</span><span style="font-family:宋体;">Q结合,处理诸如反压、规模以及维护方面的问题。本文覆盖了</span><span style="font-family:Helvetica;">rsyslog</span><span style="font-family:宋体;">集成</span><span style="font-family:Helvetica;">Kafka</span><span style="font-family:宋体;">以及</span><span style="font-family:Helvetica;">schema</span><span style="font-family:宋体;">斚w的技巧,也介l了如何q行</span><span style="font-family:Helvetica;">Kafka</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Zookeeper</span><span style="font-family:宋体;">以及</span><span style="font-family:Helvetica;">AWS</span><span style="font-family:宋体;">中大规模自动分组?/span></p> <p><a ><span style="font-family: Helvetica;">https://www.bashton.com/blog/2016/elk-on-ark/</span></a></p> <p><u> </u></p> <p><span style="font-family:Helvetica;">Hortonworks</span><span style="font-family: 宋体;">撰文介绍?/span><span style="font-family:Helvetica;">Apache Atlas</span><span style="font-family:宋体;">以及</span><span style="font-family:Helvetica;">Apache Range</span><span style="font-family:宋体;">要引入的数据管理特性。这些特性是Q分c访问控制、数据有效期{略、位|特性策略、禁止数据集l合、跨lg家族Q例如从</span><span style="font-family:Helvetica;">Kafka</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Storm</span><span style="font-family:宋体;">再到</span><span style="font-family:Helvetica;">Hive</span><span style="font-family:宋体;">的数据跟t)?/span></p> <p><a ><span style="font-family:Helvetica;">http://hortonworks.com/blog/the-next-generation-of-hadoop-based-security-data-governance/</span></a></p> <p><u> </u></p> <p><span style="font-family:Helvetica;">Apache HAWQ </span><span style="font-family: 宋体;">Q孵化中Q是一个基?/span><span style="font-family: Helvetica;">Greenplum</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">HDFS</span><span style="font-family:宋体;">上提供数据查询的</span><span style="font-family:Helvetica;">SQL</span><span style="font-family:宋体;">引擎。本文讨Z其典型设计以及新版本的诸多改q。包括它?/span><span style="font-family: Helvetica;">Spark</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">MapReduce</span><span style="font-family:宋体;">的区别,q有?/span><span style="font-family:Helvetica;">Hadoop</span><span style="font-family:宋体;">挑战l典</span><span style="font-family:Helvetica;">MPP</span><span style="font-family:宋体;">设计的内容,以及</span><span style="font-family:Helvetica;">HAWQ</span><span style="font-family:宋体;">的新设计怎样l合</span><span style="font-family:Helvetica;">MPP</span><span style="font-family:宋体;">和批处理技术进而其两者兼?/span></p> <p><a ><span style="font-family:Helvetica;">https://blog.pivotal.io/big-data-pivotal/products/apache-hawq-next-step-in-massively-parallel-processing</span></a></p> <p> </p> <p><span style="font-family:Helvetica;">Cloudera</span><span style="font-family:宋体;">博客撰文介绍了对</span><span style="font-family:Helvetica;">Hadoop</span><span style="font-family:宋体;">分布式系l进行故障注入、组|的试工具</span><span style="font-family:Helvetica;">AgenTEST</span><span style="font-family:宋体;">。它能注入网l故障(例如丢包Q,资源满蝲Q例?/span><span style="font-family:Helvetica;">CPU</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">IO</span><span style="font-family:宋体;">、磁盘空_{等。当试|络分区Ӟ可以评估环Şl网、桥接组|等{?/span></p> <p><a ><span style="font-family:Helvetica;">http://blog.cloudera.com/blog/2016/04/quality-assurance-at-cloudera-fault-injection-and-elastic-partitioning/</span></a></p> <p><u> </u></p> <p><span style="font-family:Helvetica;">Hortonworks</span><span style="font-family: 宋体;">博客展望了将包含新版?/span><span style="font-family: Helvetica;">Spark</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Zeppelin</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">HDP 2.4.2</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Spark2.0</span><span style="font-family:宋体;">预览版和</span><span style="font-family:Helvetica;">Zeppelin</span><span style="font-family:宋体;">新特性都包含在内?/span></p> <p><a ><span style="font-family:Helvetica;">http://hortonworks.com/blog/apache-spark-apache-zeppelin-whats-coming-in-hdp-2-4-2/</span></a></p> <p><u> </u></p> <p><span style="font-family:Helvetica;">Cask</span><span style="font-family:宋体;">撰文介绍了在</span><span style="font-family:Helvetica;">Hbase region compaction</span><span style="font-family:宋体;">q样|见事g发生的前后,他们是怎样通过长时间测试以评估分布式系l正性的?/span></p> <p><a ><span style="font-family:Helvetica;">http://blog.cask.co/2016/04/long-running-tests-in-cdap/</span></a></p> <p><u> </u></p> <p><span style="font-family:宋体;">本文介绍了如何结?/span><span style="font-family:Helvetica;">SparkR</span><span style="font-family:宋体;">与亚马?/span><span style="font-family:Helvetica;">EMR</span><span style="font-family:宋体;">q行地理I间分析的。通过</span><span style="font-family: Helvetica;">SparkR</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Hive</span><span style="font-family:宋体;">集成lgQ可以立d?/span><span style="font-family:Helvetica;">S3</span><span style="font-family:宋体;">上的数据映射</span><span style="font-family:Helvetica;">Hive</span><span style="font-family:宋体;">外部表。从q开始,数据p直接加蝲到内存中使用</span><span style="font-family:Helvetica;">R</span><span style="font-family:宋体;">语言分析Q很Ҏ实现高质量的数据可视化?/span></p> <p><a ><span style="font-family:Helvetica;">http://blogs.aws.amazon.com/bigdata/post/Tx1MECZ47VAV84F/Exploring-Geospatial-Intelligence-using-SparkR-on-Amazon-EMR</span></a></p> <p><u> </u></p> <p><span style="font-family:Helvetica;">MapR</span><span style="font-family:宋体;">~写了?/span><span style="font-family:Helvetica;">Pig</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Hive</span><span style="font-family:宋体;">分析职业球大联盟球队水q的教程?/span><span style="font-family:Helvetica;">Pig</span><span style="font-family:宋体;">用于数据初加工,</span><span style="font-family:Helvetica;">Hive</span><span style="font-family:宋体;">提供Z</span><span style="font-family:Helvetica;">SQL</span><span style="font-family:宋体;">的数据查询环境。借助</span><span style="font-family:Helvetica;">Hive ODBC</span><span style="font-family:宋体;">驱动?/span><span style="font-family:Helvetica;">Hive</span><span style="font-family:宋体;">服务器,使得微Y</span><span style="font-family:Helvetica;">Excel</span><span style="font-family:宋体;">也能用于获取和分析数据?/span></p> <p><a ><span style="font-family:Helvetica;">https://www.mapr.com/blog/using-hive-and-pig-baseball-statistics</span></a></p> <p><u> </u></p> <p><span style="font-family:Helvetica;">SignalFX</span><span style="font-family:宋体;">通过</span><span style="font-family:Helvetica;">27</span><span style="font-family:宋体;">节点?/span><span style="font-family:Helvetica;">Kafka</span><span style="font-family:宋体;">集群每天处理</span><span style="font-family:Helvetica;">700</span><span style="font-family:宋体;">多亿条消息。只有基于他们积累的大规?/span><span style="font-family:Helvetica;">Kafka</span><span style="font-family:宋体;">使用l验才能有如此高的量Q因此他们共享了不少调试</span><span style="font-family:Helvetica;">Kafka</span><span style="font-family:宋体;">的技巧,定位告警Q例如日志刷新gq增加)Q以?/span><span style="font-family:Helvetica;">Kafka</span><span style="font-family:宋体;">横向扩展?/span></p> <p><a ><span style="font-family:Helvetica;">http://www.confluent.io/blog/how-we-monitor-and-run-kafka-at-scale-signalfx</span></a></p> <p><u> </u></p> <p><span style="font-family:Helvetica;">dataArtisan's</span><span style="font-family: 宋体;">博客Z度量</span><span style="font-family:Helvetica;">Flink</span><span style="font-family:宋体;">在数据流效率、低延迟、正性上的能力,专门写了q篇文章。ؓ了证明效率,在高吞吐量的环境下运行了最新的</span><span style="font-family:Helvetica;">Yahoo!</span><span style="font-family:宋体;">式基准试E序。在正确性方面,文章H出?/span><span style="font-family:Helvetica;">Flink</span><span style="font-family:宋体;">事g判别和处理事Ӟ星球大战电媄q表做类比)斚w的优ѝ最后,文章描述?/span><span style="font-family:Helvetica;">Flink</span><span style="font-family:宋体;">未来版本Z内存的查询Q务?/span></p> <p><a ><span style="font-family:Helvetica;">http://data-artisans.com/counting-in-streams-a-hierarchy-of-needs/</span></a></p> <p><strong> </strong></p> <p><span style="font-family:宋体;">本教E介l了怎样?/span>TCP Socket<span style="font-family:宋体;">中的文本数据{换ؓ</span>Spark<span style="font-family:宋体;">式数据源?/span></p> <p align="left"><a ><span style="font-family:Helvetica;color:#386EFF;text-decoration:none;text-underline:none">https://medium.com/@anicolaspp/spark-custom-streaming-sources-e7d52da72e80</span></a></p> <p> </p> <p><span style="font-family:宋体;">本文介绍了在构徏</span><span style="font-family:Helvetica;">Hadoop</span><span style="font-family:宋体;">的时候怎样防止</span><span style="font-family:Helvetica;">AWS</span><span style="font-family:宋体;">证书</span><span style="font-family:宋体;">意外提交到补丁或</span><span style="font-family:Helvetica;">git</span><span style="font-family:宋体;">资源库。除</span><span style="font-family:Helvetica;">Hadoop</span><span style="font-family:宋体;">本n外,本文q徏议?/span><span style="font-family: Helvetica;">“git-secrets”</span><span style="font-family:宋体;">工具防止意外提交讉K</span><span style="font-family:Helvetica;">/</span><span style="font-family:宋体;">安全密钥。如果你用的?/span><span style="font-family:Helvetica;">Hadoop S3</span><span style="font-family:宋体;">Q还推荐了新补丁供评估?/span></p> <p><a ><span style="font-family:Helvetica;">http://steveloughran.blogspot.co.uk/2016/04/testing-against-s3-and-object-stores.html</span></a></p> <p><u> </u></p> <p><span style="font-family:Helvetica;">Big Data & Brews</span><span style="font-family:宋体;">采访?/span><span style="font-family:Helvetica;">MapR</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Ted Dunning</span><span style="font-family: 宋体;">?/span><span style="font-family:Helvetica;">Jacques Nadeau</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Apache Arrow</span><span style="font-family:宋体;">也在本次采访范围内?/span></p> <p align="left"><a ><span style="font-family: Helvetica;color:#386EFF; text-decoration:none;text-underline:none">https://www.youtube.com/watch?v=l3mDDKjDjMk</span></a></p> <p align="left"><a ><span style="font-family: Helvetica;color:#386EFF; text-decoration:none;text-underline:none">https://www.youtube.com/watch?v=Xo9CO0a0VJI</span></a></p> <p> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">其他新闻</span></strong><strong></strong></p> <p><span style="font-family:Helvetica;">DataEngConf</span><span style="font-family: 宋体;">最q在旧金山召开。本文ȝ?/span><span style="font-family: Helvetica;">Uber</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Stripe</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Microsoft</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Instacart</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Jawbone</span><span style="font-family:宋体;">的发a内容。也介绍了会议主?/span><span style="font-family:Helvetica;">“</span><span style="font-family:宋体;">数据U学在现实世界中是一个品和工程学科</span><span style="font-family:Helvetica;">”</span><span style="font-family:宋体;">?/span></p> <p><a ><span style="font-family:Helvetica;">https://medium.com/@eugmandel/software-engineering-invades-data-science-notes-from-dataengconf-4a3c066b081f#.g2h0duo44</span></a></p> <p> </p> <p><span style="font-family:Helvetica;">Hortonworks</span><span style="font-family: 宋体;">在上周都柏林举行?/span><span style="font-family:Helvetica;">Hadoop</span><span style="font-family:宋体;">Ƨ洲C上大攑ּ彩?/span><span style="font-family:Helvetica;">ZDNet</span><span style="font-family:宋体;">报导了这些亮点,其中包括?/span><span style="font-family:Helvetica;">Pivotal</span><span style="font-family:宋体;">Q已转售l?/span><span style="font-family:Helvetica;">HDP</span><span style="font-family:宋体;">Q的扩展合作Q与</span><span style="font-family:Helvetica;">Syncosrt</span><span style="font-family:宋体;">的{售协议,以及</span><span style="font-family:Helvetica;">Atlas</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Ranger</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Zeppelin</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Metron</span><span style="font-family:宋体;">的技术预览。报D介绍?/span><span style="font-family: Helvetica;">Hortonworks</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Cloudera</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">MapR</span><span style="font-family:宋体;">产品的不同之处?/span></p> <p><a ><span style="font-family:Helvetica;">http://www.zdnet.com/article/hortonworks-announces-new-alliances-and-releases-hadoop-comes-to-fork-in-road/</span></a></p> <p><u> </u></p> <p><span style="font-family:Helvetica;">Flink 2016</span><span style="font-family:宋体;">C在九月于d国柏林D行。讨题征集将于六月末l束?/span></p> <p><a ><span style="font-family:Helvetica;">http://flink.apache.org/news/2016/04/14/flink-forward-announce.html</span></a></p> <p><u> </u></p> <p><span style="font-family:Helvetica;">YouTube</span><span style="font-family:宋体;">上发布了</span><span style="font-family:Helvetica;">Hadoop</span><span style="font-family:宋体;">都柏林峰会演讲视频。正如预期的那样Q这些演讲内Ҏ?/span><span style="font-family:Helvetica;">Hadoop</span><span style="font-family:宋体;">生态系l的各个部分?/span></p> <p><a >https://www.youtube.com/channel/UCAPa-K_rhylDZAUHVxqqsRA/videos?flow=list&live_view=500&view=0&sort=dd</a></p> <p> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">产品发布</span></strong><strong></strong></p> <p><span style="font-family:Helvetica;">Metascope</span><span style="font-family:宋体;">是一个配?/span><span style="font-family:Helvetica;">Schedoscope</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Hadoop</span><span style="font-family:宋体;">集群中进行元数据理的新工具。通过</span><span style="font-family:Helvetica;">web</span><span style="font-family:宋体;">界面Q利用数据沿袭它能洞察大量的数据。也提供索、内嵌文?/span><span style="font-family: Helvetica;">REST API</span><span style="font-family:宋体;">{等功能?/span></p> <p><a ><span style="font-family:Helvetica;">https://github.com/ottogroup/metascope</span></a></p> <p><u> </u></p> <p><span style="font-family:Helvetica;">Apache HBase 1.2.1</span><span style="font-family:宋体;">于本周发布,?/span><span style="font-family:Helvetica;">1.2.0</span><span style="font-family:宋体;">的基上解决了</span><span style="font-family:Helvetica;">27</span><span style="font-family:宋体;">个问题。发布声明中重点介绍了四个高优先U的问题?/span></p> <p><a ><span style="font-family:Helvetica;">http://mail-archives.us.apache.org/mod_mbox/www-announce/201604.mbox/%3CCAN5cbe7-T5uAYvGRbxw2dfvdbwe5s0nx3vKU8Nt2fzXbKPoQTg@mail.gmail.com%3E</span></a></p> <p> </p> <p><span style="font-family:Helvetica;">Apache Mahout</span><span style="font-family: 宋体;">机器学习库发布了</span><span style="font-family:Helvetica;">0.12.0</span><span style="font-family:宋体;">版。该版本?/span><span style="font-family:Helvetica;">“Samsara”</span><span style="font-family:宋体;">数学环境开始支?/span><span style="font-family:Helvetica;">Apache Flink</span><span style="font-family: 宋体;">了,q且是^台无关的。发布声明中分n了与</span><span style="font-family:Helvetica;">Flink</span><span style="font-family:宋体;">集成、已知问题、项目演q计划相关的内容?/span></p> <p><a ><span style="font-family:Helvetica;">http://mail-archives.us.apache.org/mod_mbox/www-announce/201604.mbox/%3CCAOtpBjj5An876PStdn5kMeaF+up-B72WTmCk9j21EXdP=JOCUA@mail.gmail.com%3E</span></a></p> <p><u> </u></p> <p><span style="font-family:Helvetica;">Apache Storm 1.0.0</span><span style="font-family:宋体;">本周发布了。亮点包括性能提升Q普遍提?/span><span style="font-family:Helvetica;">3</span><span style="font-family:宋体;">倍以上)、新的分布式~存</span><span style="font-family:Helvetica;">API</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">nimbus</span><span style="font-family:宋体;">的高可用性、自动反压、动?/span><span style="font-family:Helvetica;">worker</span><span style="font-family:宋体;">性能分析{等?/span></p> <p><a ><span style="font-family:Helvetica;">http://storm.apache.org/2016/04/12/storm100-released.html</span></a></p> <p><u> </u></p> <p><span style="font-family:Helvetica;">Apache Kudu</span><span style="font-family: 宋体;">Q孵化中Q本周发布了</span><span style="font-family: Helvetica;">0.8.0</span><span style="font-family:宋体;">版。本ơ发布添加了</span><span style="font-family:Helvetica;">Apache Flume sink</span><span style="font-family:宋体;">、部分功能提升、修复了一?/span><span style="font-family: Helvetica;">bug</span><span style="font-family:宋体;">?/span></p> <p><a ><span style="font-family:Helvetica;">http://getkudu.io/releases/0.8.0/docs/release_notes.html</span></a></p> <p><u> </u></p> <p align="left"><span style="font-family:Helvetica;">Cloudbreak</span><span style="font-family:宋体;">本周发布?/span><span style="font-family:Helvetica;">1.2</span><span style="font-family:宋体;">版,它ؓ云环境提?/span><span style="font-family:Helvetica;">Hadoop</span><span style="font-family:宋体;">集群</span><span style="font-family:Helvetica;">Docker</span><span style="font-family:宋体;">。新Ҏ包括支?/span><span style="font-family:Helvetica;">OpenStack</span><span style="font-family:宋体;">以及定义服务器提供配|脚本?/span></p> <p align="left"><a ><span style="font-family:Helvetica;">http://hortonworks.com/blog/announcing-cloudbreak-1-2/</span></a></p> <p align="left"><u> </u></p> <p align="left"><span style="font-family:Helvetica;">Cloudera</span><span style="font-family:宋体;">发布?/span><span style="font-family:Helvetica;">Cloudera Enterprise 5.4.10</span><span style="font-family:宋体;">Q内|了</span><span style="font-family:Helvetica;">Flume</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Hadoop</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">HBase</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Hive</span><span style="font-family:宋体;">?/span><span style="font-family:Helvetica;">Impala</span><span style="font-family:宋体;">{组件?/span></p> <p align="left"><a ><span style="font-family:Helvetica;">http://community.cloudera.com/t5/Community-News-Release/ANNOUNCE-Cloudera-Enterprise-5-4-10-Released/m-p/39790#U39790</span></a></p> <p align="left"><u> </u></p> <p align="left"><span style="font-family:Helvetica;">Presto Accumulo</span><span style="font-family:宋体;">是个新项目,?/span><span style="font-family:Helvetica;">Accumulo</span><span style="font-family:宋体;">d数据提供?/span><span style="font-family:Helvetica;">Presto</span><span style="font-family:宋体;">q接器?/span></p> <p align="left"><a ><span style="font-family:Helvetica;">https://github.com/bloomberg/presto-accumulo</span></a></p> <p align="left"> </p> <p><strong><span style="font-size:15.0pt;font-family:宋体;">zd</span></strong><strong></strong></p> <p align="left"><span style="font-size:14.0pt;font-family:SimSun;">中国</span></p> <p align="left"><span style="font-family:SimSun;">?/span></p><img src ="http://m.tkk7.com/rosen/aggbug/430176.html" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://m.tkk7.com/rosen/" target="_blank">Rosen</a> 2016-04-21 15:07 <a href="http://m.tkk7.com/rosen/archive/2016/04/21/430176.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>Hadoop周刊—第 165 ?/title><link>http://m.tkk7.com/rosen/archive/2016/04/14/430099.html</link><dc:creator>Rosen</dc:creator><author>Rosen</author><pubDate>Thu, 14 Apr 2016 10:02:00 GMT</pubDate><guid>http://m.tkk7.com/rosen/archive/2016/04/14/430099.html</guid><wfw:comment>http://m.tkk7.com/rosen/comments/430099.html</wfw:comment><comments>http://m.tkk7.com/rosen/archive/2016/04/14/430099.html#Feedback</comments><slash:comments>0</slash:comments><wfw:commentRss>http://m.tkk7.com/rosen/comments/commentRss/430099.html</wfw:commentRss><trackback:ping>http://m.tkk7.com/rosen/services/trackbacks/430099.html</trackback:ping><description><![CDATA[<p><strong><span style="font-size:22.0pt;font-family:"Lantinghei SC Demibold"; color:#355400;">Hadoop</span><span style="font-size:22.0pt; font-family:"Lantinghei SC Demibold";color:#355400;">周刊</span></strong></p> <p><strong> </strong></p> <p><span style="font-size:14.0pt;font-family:"Lantinghei SC Demibold"; color:#355400;"><strong>W?165 ?2016q??0?</strong></span></p> <p><span style="font-size:10.5pt;font-family:"Lantinghei SC Demibold"; color:#355400;"><strong>启明星辰——q_和大数据整体l编?/strong></span></p> <p> </p> <p><span style="font-size:12.0pt;line-height:135%; font-family:宋体;Times New Roman";Times New Roman";">本周Q包?/span><span style="font-size:12.0pt;line-height:135%">LinkedIn </span><span style="font-size:12.0pt;line-height:135%;font-family:宋体;Times New Roman";Times New Roman";">?/span><span style="font-size: 12.0pt;line-height:135%">Airbnb</span><span style="font-size:12.0pt;line-height: 135%;font-family:宋体;Times New Roman";Times New Roman";">新开源项目在内的C产品q行了重大版本发布。本期技术部分与式处理有关</span><span style="font-size:12.0pt;line-height:135%">——Spark</span><span style="font-size:12.0pt;line-height:135%;font-family:宋体;Times New Roman";Times New Roman";">?/span><span style="font-size: 12.0pt;line-height:135%">Flink</span><span style="font-size:12.0pt;line-height: 135%;font-family:宋体;Times New Roman";Times New Roman";">?/span><span style="font-size:12.0pt;line-height:135%">Kafka</span><span style="font-size:12.0pt;line-height:135%;font-family:宋体;Times New Roman";Times New Roman";">{等Q新闻部分是关于</span><span style="font-size:12.0pt;line-height:135%">Spark Summit </span><span style="font-size:12.0pt;line-height:135%;font-family:宋体;Times New Roman";Times New Roman";">?/span><span style="font-size: 12.0pt;line-height:135%">HbaseCon</span><span style="font-size:12.0pt; line-height:135%;font-family:宋体;Times New Roman";Times New Roman";">的会议议E?/span></p> <h1><span style="font-family: 'Comic Sans MS'; font-size: 18pt;">技?/span></h1> <p><span style="font-size:10.5pt;">Zalando</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman";">发表了他们是如何选择</span><span style="font-size:10.5pt;">Apache Flink</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman";">作ؓ式处理框架的文章。该文章阐述了对评h标准q行验证后得出的l论Q阐明了选择</span><span style="font-size:10.5pt;">Apache Flink</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman";">的主?/span><span style="font-size:10.5pt;">—</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman";">在高吞吐量的情况下依然能保持低gq,真正的流式处理,开发h员支持?/span></p> <p><a ><span style="font-size:10.5pt;font-family:"Helvetica Neue";Times New Roman";color:#0088CC;background:white">https://tech.zalando.com/blog/apache-showdown-flink-vs.-spark/</span></a></p> <p> </p> <p><span style="font-size:10.5pt">Cloudera</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">博客刊登了来?/span><span style="font-size:10.5pt">Wargaming.net</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">的文章,通过本文可了解到他们如何通过</span><span style="font-size: 10.5pt">Kafka</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">?/span><span style="font-size:10.5pt">HBase</span><span style="font-size:10.5pt;font-family: 宋体;Times New Roman";Times New Roman"">?/span><span style="font-size:10.5pt">Drools</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">?/span><span style="font-size:10.5pt">Spark</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">构徏实时处理基础设施的。另外,在数据流E方面,他们介绍了如何对</span><span style="font-size:10.5pt">HBase</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">的检索和序列化?/span><span style="font-size:10.5pt">HBase</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">?/span><span style="font-size:10.5pt">Spark</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">之间的数据本地化以及</span><span style="font-size:10.5pt">Spark</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">计算斚w的优化措施?/span></p> <p><a ><span style="font-size:10.5pt;font-family:"Helvetica Neue";Times New Roman";color:#0088CC;background:white">http://blog.cloudera.com/blog/2016/04/inside-wargamings-data-driven-real-time-rules-engine/</span></a></p> <p> </p> <p><span style="font-size:10.5pt">InfoQ</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">发布了大规模式处理</span><span style="font-size:10.5pt">—SMACK</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">Q?/span><span style="font-size:10.5pt">Spark</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">?/span><span style="font-size:10.5pt">Mesos</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">?/span><span style="font-size:10.5pt">Akka</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">?/span><span style="font-size:10.5pt">Cassandra</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">以及</span><span style="font-size:10.5pt"> Kafka</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">Q栈的介l视频。讨ZZ?/span><span style="font-size:10.5pt">SMACK</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">栈在处理同样问题的时候比</span><span style="font-size:10.5pt">Lambda</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">架构更简单?/span></p> <p><a ><span style="font-size:10.5pt;font-family:"Helvetica Neue";Times New Roman";color:#0088CC;background:white">http://www.infoq.com/presentations/stream-analytics-scalability</span></a></p> <p> </p> <p><span style="font-size:10.5pt">Confluent“</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">日志压羃</span><span style="font-size:10.5pt">”</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">pd博文又有更新Q介l了</span><span style="font-size:10.5pt">Kafka</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">目三月份发生的事情。有不少令hx的开发内容,包括机架感知?/span><span style="font-size:10.5pt">Kerberos</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">支持、基于时间烦引方面的q展。以及不你Q我也是Q没有时间持l关注的最新研发成果?/span></p> <p><a ><span style="font-size:10.5pt;font-family:"Helvetica Neue";Times New Roman";color:#0088CC;background:white">http://www.confluent.io/blog/log-compaction-highlights-in-the-kafka-and-stream-processing-community-april-2016</span></a></p> <p> </p> <p><span style="font-size:10.5pt">Apache Flink 1.0</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">引入了新的复杂事件处理(</span><span style="font-size:10.5pt">CEP</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">Q库。啰嗦几句,</span><span style="font-size:10.5pt">CEP</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">提供了一U检事件模式的Ҏ。本文借助传感器从数据中心服务器上攉数据Q运用一U可能的异常用例,诠释?/span><span style="font-size:10.5pt">Flink</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">?/span><span style="font-size:10.5pt">CEP</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">模式</span><span style="font-size:10.5pt">API </span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">?/span></p> <p><a ><span style="font-size:10.5pt;font-family:"Helvetica Neue";Times New Roman";color:#0088CC;background:white">http://flink.apache.org/news/2016/04/06/cep-monitoring.html</span></a></p> <p> </p> <p><span style="font-size:10.5pt">Genome Analysis Toolkit </span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">Q?/span><span style="font-size:10.5pt">GATK</span><span style="font-size:10.5pt;font-family: 宋体;Times New Roman";Times New Roman"">Q最q宣布,下一个版本(当前?/span><span style="font-size:10.5pt">alpha</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">Q将支持</span><span style="font-size:10.5pt">Apache Spark</span><span style="font-size: 10.5pt;font-family:宋体;Times New Roman";Times New Roman"">。本文简要介l了工具ƈ展示了怎样通过</span><span style="font-size:10.5pt">Spark</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">来检重?/span><span style="font-size:10.5pt">DNA</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">片段的?/span></p> <p><a ><span style="font-size:10.5pt;font-family:"Helvetica Neue";Times New Roman";color:#0088CC;background:white">http://blog.cloudera.com/blog/2016/04/genome-analysis-toolkit-now-using-apache-spark-for-data-processing/</span></a></p> <p> </p> <p><span style="font-size:10.5pt">InfoWorld</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">lD?/span><span style="font-size:10.5pt">Spark2.0</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">关于l构化流式处理方面的计划。微批处理将依然延箋Q还有些新特性,例如无限数据帧(</span><span style="font-size:10.5pt">Infinite DataFrames</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">Q、一的重复查询支持?/span></p> <p><a ><span style="font-size:10.5pt;font-family:"Helvetica Neue";Times New Roman";color:#0088CC;background:white">http://www.infoworld.com/article/3052924/analytics/what-sparks-structured-streaming-really-means.html</span></a></p> <p> </p> <p><span style="font-size:10.5pt">AWS</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">大数据博客发布了一通过存储?/span><span style="font-size: 10.5pt">AWS Key Management Service </span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">Q?/span><span style="font-size:10.5pt">KMS</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">Q中的加密密钥加载数据到</span><span style="font-size:10.5pt">S3</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">?/span><span style="font-size:10.5pt">Redshift</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">的文章。除了描q所需步骤Q本文还介绍了如何在</span><span style="font-size:10.5pt">AWS S3</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">中通过</span><span style="font-size:10.5pt">KMS</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">密钥加密数据?/span></p> <p><a ><span style="font-size:10.5pt;font-family:"Helvetica Neue";Times New Roman";color:#0088CC;background:white">http://blogs.aws.amazon.com/bigdata/post/Tx2Q3ZBOZO9DHVQ/Encrypt-Your-Amazon-Redshift-Loads-with-Amazon-S3-and-AWS-KMS</span></a></p> <p> </p> <p><span style="font-size:10.5pt">Confluent</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">博客介绍了如何?/span><span style="font-size:10.5pt">Kafka Connect </span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">?/span><span style="font-size:10.5pt"> Kafka Streams </span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">~写非凡?/span><span style="font-size:10.5pt">“hello world”</span><span style="font-size: 10.5pt;font-family:宋体;Times New Roman";Times New Roman"">E序。更切地说Q范例程序从</span><span style="font-size:10.5pt">IRC</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">拉维基百U数据,q解析消息、进行多斚w的统计计。本文还用了若干E序展示了整个实现过E?/span></p> <p><a ><span style="font-size:10.5pt;font-family:"Helvetica Neue";Times New Roman";color:#0088CC;background:white">http://www.confluent.io/blog/hello-world-kafka-connect-kafka-streams</span></a></p> <p> </p> <p style="line-height:107%"><span style="font-size:10.5pt; line-height:107%;font-family:宋体;Times New Roman";Times New Roman"">本文?/span><span style="font-size:10.5pt; line-height:107%">Postgres </span><span style="font-size:10.5pt;line-height: 107%;font-family:宋体;Times New Roman";Times New Roman"">?/span><span style="font-size:10.5pt;line-height:107%"> Cassandra</span><span style="font-size:10.5pt;line-height:107%;font-family:宋体;Times New Roman";Times New Roman"">转换单的模式Q?/span><span style="font-size:10.5pt;line-height:107%">schemas</span><span style="font-size: 10.5pt;line-height:107%;font-family:宋体;Times New Roman";Times New Roman"">Q,q描qC主要的差?/span><span style="font-size:10.5pt; line-height:107%">—</span><span style="font-size:10.5pt;line-height:107%; font-family:宋体;Times New Roman";Times New Roman"">复制、数据类型(</span><span style="font-size:10.5pt;line-height:107%">Cassandra</span><span style="font-size:10.5pt;line-height:107%;font-family:宋体;Times New Roman";Times New Roman"">不支?/span><span style="font-size:10.5pt;line-height:107%">JSON</span><span style="font-size: 10.5pt;line-height:107%;font-family:宋体;Times New Roman";Times New Roman"">Q、主键、最l以一致性?/span></p> <p><a ><span style="font-size:10.5pt;font-family:"Helvetica Neue";Times New Roman";color:#0088CC;background:white">http://neovintage.org/2016/04/07/data-modeling-in-cassandra-from-a-postgres-perspective/</span></a></p> <p> </p> <h1><span style="font-family: 'Comic Sans MS'; font-size: 18pt;">新闻</span></h1> <p style="line-height:107%"><span style="font-size: 10.5pt;line-height:107%">ESG</span><span style="font-size:10.5pt;line-height: 107%;font-family:宋体;Times New Roman";Times New Roman"">博客报导了最q?/span><span style="font-size:10.5pt;line-height:107%">Strata+Hadoop World</span><span style="font-size:10.5pt;line-height:107%;font-family:宋体;Times New Roman";Times New Roman"">大会的情cƈ有些重点xQ例?/span><span style="font-size:10.5pt;line-height:107%">Spark</span><span style="font-size:10.5pt;line-height:107%;font-family:宋体;Times New Roman";Times New Roman"">的良好势头、机器学习、云服务?/span></p> <p><a ><span style="font-size:10.5pt;font-family:"Helvetica Neue";Times New Roman";color:#0088CC;background:white">http://blog.esg-global.com/riding-high-at-stratahadoop-world</span></a></p> <p> </p> <p><span style="font-size:10.5pt">InformationWeek</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">也报g</span><span style="font-size:10.5pt">Strata</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">大会Q关注了</span><span style="font-size:10.5pt">MapR</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">?/span><span style="font-size:10.5pt">Pivotal</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">的关灯片、h工智能等?/span></p> <p><a ><span style="font-size:10.5pt;font-family:"Helvetica Neue";Times New Roman";color:#0088CC;background:white">http://www.informationweek.com/big-data/ai-public-data-sets-real-time-strata-+-hadoop-keynote-sampling/d/d-id/1324943?</span></a></p> <p> </p> <p><span style="font-size:10.5pt">Spark Summit 2016</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">议程敲定Q将?/span><span style="font-size:10.5pt">6</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">?/span><span style="font-size:10.5pt">6-8</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">日在旧金׃D行。会议将有两天展开五个方向的讨论?/span></p> <p><a ><span style="font-size:10.5pt;font-family:"Helvetica Neue";Times New Roman";color:#0088CC;background:white">https://databricks.com/blog/2016/04/04/agenda-announced-for-sparksummit-2016-in-san-francisco.html</span></a></p> <p> </p> <p><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">布斯采访了</span><span style="font-size:10.5pt">Cloudera CEO Tom Reilly</span><span style="font-size: 10.5pt;font-family:宋体;Times New Roman";Times New Roman"">Q他讨论了公司的机遇、竞争性市场、上市计划等?/span></p> <p><a ><span style="font-size:10.5pt;font-family:"Helvetica Neue";Times New Roman";color:#0088CC;background:white">http://www.forbes.com/sites/roberthof/2016/04/06/ceo-tom-reilly-makes-the-case-for-cloudera-and-its-ipo/</span></a></p> <p> </p> <p><span style="font-size:10.5pt">Datanami</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">撰文正在崛L</span><span style="font-size:10.5pt">Apache Kafka</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">作ؓ式处理的支柱。文章还采访?/span><span style="font-size:10.5pt">Confluent</span><span style="font-size: 10.5pt;font-family:宋体;Times New Roman";Times New Roman"">联合创始人兼</span><span style="font-size:10.5pt">CTO Neha Narkhede</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">Q坊间她表示最q将推出</span><span style="font-size:10.5pt">Kafka Connect </span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">?/span><span style="font-size:10.5pt"> Kafka Streams</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">?/span></p> <p><a ><span style="font-size:10.5pt;font-family:"Helvetica Neue";Times New Roman";color:#0088CC;background:white">http://www.datanami.com/2016/04/06/real-time-rise-apache-kafka/</span></a></p> <p> </p> <p><span style="font-size:10.5pt">HBaseCon</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">于</span><span style="font-size:10.5pt">5</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">?/span><span style="font-size:10.5pt">24</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">日在旧金山召开Q最q议E才正式宣布。在三个方向上,有</span><span style="font-size:10.5pt">20</span><span style="font-size:10.5pt;font-family: 宋体;Times New Roman";Times New Roman"">个以上的议题要讨论?/span></p> <p><a ><span style="font-size:10.5pt;font-family:"Helvetica Neue";Times New Roman";color:#0088CC;background:white">http://blog.cloudera.com/blog/2016/04/hbasecon-2016-speaker-lineup-announced/</span></a></p> <p> </p> <h1><span style="font-family: 'Comic Sans MS'; font-size: 18pt;">发布</span></h1> <p> <span style="font-size:10.5pt">Apache HBase 0.98.18 </span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">?/span><span style="font-size:10.5pt">1.1.4</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">最q都发布了?/span><span style="font-size:10.5pt">1.1.4</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">上有包括九个或正性在内的若干修复?/span><span style="font-size: 10.5pt">HBase 0.98.18</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">答{的仅解决了</span><span style="font-size:10.5pt">50</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">个问题(</span><span style="font-size:10.5pt">bug</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">、改善两个新Ҏ)?/span></p> <p><a ><span style="font-size:10.5pt;font-family:"Helvetica Neue";Times New Roman";color:#0088CC;background:white">http://mail-archives.apache.org/mod_mbox/hbase-user/201603.mbox/%3CCANZa%3DGu-mAxKEtfoRjctHcE0KD7z52oE010Fgsf6AMmW2tDZLA%40mail.gmail.com%3E</span></a> <span style="font-size:10.5pt;font-family:"Helvetica Neue";Times New Roman";color:#333333"><br /> </span><a ><span style="font-size:10.5pt;font-family:"Helvetica Neue";Times New Roman";color:#0088CC;background:white">http://mail-archives.apache.org/mod_mbox/hbase-user/201603.mbox/%3CCA%2BRK%3D_CtZ1L07nS6Og2ekfVwet0qTE7jw-bmyD2pp5UPweUehQ%40mail.gmail.com%3E</span></a></p> <p> </p> <p><span style="font-size:10.5pt">Apache Lens</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">发布?/span><span style="font-size:10.5pt">2.5.0-beta</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">Q作为统一分析接口Q它已经支持</span><span style="font-size: 10.5pt">Hadoop</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">生态系l的执行引擎数据存储了。本ơ发布解决了</span><span style="font-size:10.5pt">87</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">,主要?/span><span style="font-size:10.5pt">bug</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">修复和实现新功能?/span></p> <p><a ><span style="font-size:10.5pt;font-family:"Helvetica Neue";Times New Roman";color:#0088CC;background:white">http://mail-archives.us.apache.org/mod_mbox/www-announce/201604.mbox/%3CCAL3kmZj60kpopRPpOVEs9o7oTg7YuaC_=c8zncBeMyUESrZsmQ@mail.gmail.com%3E</span></a></p> <p> </p> <p><span style="font-size:10.5pt">Airbnb </span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">开源了</span><span style="font-size:10.5pt"> Caravel</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">Q数据探索系l(数据可视化^収ͼ?/span><span style="font-size: 10.5pt">Caravel</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">支持多种在商业品上才能看到的特性,能够q接CQ意只要支?/span><span style="font-size:10.5pt">SQL</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">方言的系l。尤其它支持面向</span><span style="font-size:10.5pt">Druid</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">的实时分析?/span></p> <p><a ><span style="font-size:10.5pt;font-family:"Helvetica Neue";Times New Roman";color:#0088CC;background:white">https://medium.com/airbnb-engineering/caravel-airbnb-s-data-exploration-platform-15a72aa610e5</span></a></p> <p> </p> <p><span style="font-size:10.5pt">MapR </span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">宣布支持</span><span style="font-size:10.5pt">Apache Drill 1.6</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">作ؓ他们的分布式pȝ。比较有亮点的发布有</span><span style="font-size:10.5pt;font-family:"Helvetica Neue";Times New Roman";color:#333333;background:white">MapR-DB</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">新存储插件、新</span><span style="font-size:10.5pt">SQL</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">H口函数支持以及端对端安全。在|页介绍部分Q有些?/span><span style="font-size:10.5pt;font-family:"Helvetica Neue";Times New Roman";color:#333333;background:white">MapR-DB API</span><span style="font-size:10.5pt;font-family:"MS Mincho";MS Mincho"; color:#333333;background:white">?/span><span style="font-size:10.5pt; font-family:SimSun;color:#333333;background:white">?/span><span style="font-size:10.5pt;font-family:"MS Mincho";MS Mincho"; color:#333333;background:white">数据q?/span><span style="font-size:10.5pt; font-family:SimSun;color:#333333;background:white">q?/span><span style="font-size:10.5pt">Drill</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">查询的例子?/span></p> <p><a ><span style="font-size:10.5pt;font-family:"Helvetica Neue";Times New Roman";color:#0088CC;background:white">https://www.mapr.com/blog/apache-drill-16-mapr-converged-platform-gearing-new-generation-stack-json-enabled-big-data</span></a></p> <p> </p> <p><span style="font-size:10.5pt">Apache Flink</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">发布了修?/span><span style="font-size:10.5pt">bug</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">后的</span><span style="font-size:10.5pt">1.0.x</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">。这ơ发布解决了</span><span style="font-size:10.5pt">23</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">个问题,推荐所?/span><span style="font-size:10.5pt">1.0.0</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">的用户升U?/span></p> <p><a ><span style="font-size:10.5pt;font-family:"Helvetica Neue";Times New Roman";color:#0088CC;background:white">http://flink.apache.org/news/2016/04/06/release-1.0.1.html</span></a></p> <p> </p> <p><span style="font-size:10.5pt">Cloudera Enterprise 5.7</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">发布附带?/span><span style="font-size:10.5pt">Spark</span><span style="font-size:10.5pt;font-family: 宋体;Times New Roman";Times New Roman"">?/span><span style="font-size:10.5pt">HBase</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">?/span><span style="font-size:10.5pt">Impala</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">?/span><span style="font-size:10.5pt">Kafka</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">{组件版本的升。本ơ发布的亮点包括?/span><span style="font-size:10.5pt">Cloudera Labs </span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">新鲜推荐?/span><span style="font-size:10.5pt">Hive-on-Spark</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">?/span><span style="font-size:10.5pt">HBase-Spark</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">?/span><span style="font-size:10.5pt">Impala</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">性能重要提升Q支?/span><span style="font-size:10.5pt">SSD </span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">?/span><span style="font-size:10.5pt">HBase WAL</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">?/span></p> <p><a ><span style="font-size:10.5pt;font-family:"Helvetica Neue";Times New Roman";color:#0088CC;background:white">http://blog.cloudera.com/blog/2016/04/cloudera-enterprise-5-7-is-released/</span></a></p> <p> </p> <p><span style="font-size:10.5pt">Apache Tajo</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">Q构建在</span><span style="font-size:10.5pt">Hadoop</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">上的数据仓库pȝQ发布了</span><span style="font-size:10.5pt">0.11.2</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">版。新版本支持?/span><span style="font-size:10.5pt">Kerberos</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">Q修复了</span><span style="font-size:10.5pt">ORC</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">表对</span><span style="font-size:10.5pt">Hive</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">的支持等?/span></p> <p><a ><span style="font-size:10.5pt;font-family:"Helvetica Neue";Times New Roman";color:#0088CC;background:white">http://tajo.apache.org/releases/0.11.2/announcement.html</span></a></p> <p> </p> <p><span style="font-size:10.5pt">LinkedIn </span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">开源了</span><span style="font-size:10.5pt"> Dr. Elephant</span><span style="font-size:10.5pt;font-family:宋体;Times New Roman";Times New Roman"">Q里面的工具能诊?/span><span style="font-size:10.5pt">Hadoop</span><span style="font-size:10.5pt;font-family: 宋体;Times New Roman";Times New Roman"">?/span><span style="font-size:10.5pt">Spark</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">d的性能问题。基?/span><span style="font-size:10.5pt">metrics</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">?/span><span style="font-size:10.5pt">YARN</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">资源理器收集已完成d数据Q?/span><span style="font-size:10.5pt">Dr. Elephant</span><span style="font-size: 10.5pt;font-family:宋体;Times New Roman";Times New Roman"">评估后生成诊断报表,内容包括数据错位?/span><span style="font-size:10.5pt">GC</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">开销{?/span><span style="font-size:10.5pt">LinkedIn</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">宣称借助它能解决</span><span style="font-size:10.5pt">80%</span><span style="font-size:10.5pt; font-family:宋体;Times New Roman";Times New Roman"">的问题?/span></p> <p><a ><span style="font-size:10.5pt;font-family:"Helvetica Neue";Times New Roman";color:#0088CC;background:white">https://engineering.linkedin.com/blog/2016/04/dr-elephant-open-source-self-serve-performance-tuning-hadoop-spark</span></a></p> <p> </p> <h1><span style="font-family: 'Comic Sans MS'; font-size: 18pt;">zd</span></h1> <p><strong><span style="font-size:16.0pt;font-family:宋体;Times New Roman";Times New Roman"">中国</span></strong><strong></strong></p> <p><span style="font-family:宋体;Times New Roman";Times New Roman"">?/span></p><img src ="http://m.tkk7.com/rosen/aggbug/430099.html" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://m.tkk7.com/rosen/" target="_blank">Rosen</a> 2016-04-14 18:02 <a href="http://m.tkk7.com/rosen/archive/2016/04/14/430099.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>开源面向对象数据库 db4o 之旅: 使用 dRS “db4o 之旅Q四Q?/title><link>http://m.tkk7.com/rosen/archive/2010/07/09/325618.html</link><dc:creator>Rosen</dc:creator><author>Rosen</author><pubDate>Fri, 09 Jul 2010 02:19:00 GMT</pubDate><guid>http://m.tkk7.com/rosen/archive/2010/07/09/325618.html</guid><wfw:comment>http://m.tkk7.com/rosen/comments/325618.html</wfw:comment><comments>http://m.tkk7.com/rosen/archive/2010/07/09/325618.html#Feedback</comments><slash:comments>9</slash:comments><wfw:commentRss>http://m.tkk7.com/rosen/comments/commentRss/325618.html</wfw:commentRss><trackback:ping>http://m.tkk7.com/rosen/services/trackbacks/325618.html</trackback:ping><description><![CDATA[     摘要: 很多开发者对 hibernate 性能表示|疑Q下一ơ技术革C是什么呢Q——对象数据库 <br>q篇文章是开源面向对象数据库 db4o 之旅 pd文章的第 4 部分Q介l面向对象数据库 db4o ?db4o Replication System(dRS) —?db4o 复制pȝQƈ对其如何同步 Oracle 数据库进行分析?nbsp; <a href='http://m.tkk7.com/rosen/archive/2010/07/09/325618.html'>阅读全文</a><img src ="http://m.tkk7.com/rosen/aggbug/325618.html" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://m.tkk7.com/rosen/" target="_blank">Rosen</a> 2010-07-09 10:19 <a href="http://m.tkk7.com/rosen/archive/2010/07/09/325618.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>使用SoftReference软引?/title><link>http://m.tkk7.com/rosen/archive/2010/06/22/324173.html</link><dc:creator>Rosen</dc:creator><author>Rosen</author><pubDate>Tue, 22 Jun 2010 07:27:00 GMT</pubDate><guid>http://m.tkk7.com/rosen/archive/2010/06/22/324173.html</guid><wfw:comment>http://m.tkk7.com/rosen/comments/324173.html</wfw:comment><comments>http://m.tkk7.com/rosen/archive/2010/06/22/324173.html#Feedback</comments><slash:comments>2</slash:comments><wfw:commentRss>http://m.tkk7.com/rosen/comments/commentRss/324173.html</wfw:commentRss><trackback:ping>http://m.tkk7.com/rosen/services/trackbacks/324173.html</trackback:ping><description><![CDATA[     摘要: q是做个实际的SoftReference试吧?nbsp; <a href='http://m.tkk7.com/rosen/archive/2010/06/22/324173.html'>阅读全文</a><img src ="http://m.tkk7.com/rosen/aggbug/324173.html" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://m.tkk7.com/rosen/" target="_blank">Rosen</a> 2010-06-22 15:27 <a href="http://m.tkk7.com/rosen/archive/2010/06/22/324173.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>使用Memory Analyzer tool(MAT)分析内存泄漏Q二Q?/title><link>http://m.tkk7.com/rosen/archive/2010/06/13/323522.html</link><dc:creator>Rosen</dc:creator><author>Rosen</author><pubDate>Sun, 13 Jun 2010 08:13:00 GMT</pubDate><guid>http://m.tkk7.com/rosen/archive/2010/06/13/323522.html</guid><wfw:comment>http://m.tkk7.com/rosen/comments/323522.html</wfw:comment><comments>http://m.tkk7.com/rosen/archive/2010/06/13/323522.html#Feedback</comments><slash:comments>19</slash:comments><wfw:commentRss>http://m.tkk7.com/rosen/comments/commentRss/323522.html</wfw:commentRss><trackback:ping>http://m.tkk7.com/rosen/services/trackbacks/323522.html</trackback:ping><description><![CDATA[     摘要: 在^时工作过E中Q有时会遇到OutOfMemoryErrorQ我们知道遇到Error一般表明程序存在着严重问题Q可能是N性的。所以找出是什么原因造成OutOfMemoryError非常重要。现在向大家引荐Eclipse Memory Analyzer tool(MAT)Q来化解我们遇到的难题?nbsp; <a href='http://m.tkk7.com/rosen/archive/2010/06/13/323522.html'>阅读全文</a><img src ="http://m.tkk7.com/rosen/aggbug/323522.html" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://m.tkk7.com/rosen/" target="_blank">Rosen</a> 2010-06-13 16:13 <a href="http://m.tkk7.com/rosen/archive/2010/06/13/323522.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item><item><title>使用Memory Analyzer tool(MAT)分析内存泄漏Q一Q?/title><link>http://m.tkk7.com/rosen/archive/2010/05/21/321575.html</link><dc:creator>Rosen</dc:creator><author>Rosen</author><pubDate>Fri, 21 May 2010 12:59:00 GMT</pubDate><guid>http://m.tkk7.com/rosen/archive/2010/05/21/321575.html</guid><wfw:comment>http://m.tkk7.com/rosen/comments/321575.html</wfw:comment><comments>http://m.tkk7.com/rosen/archive/2010/05/21/321575.html#Feedback</comments><slash:comments>23</slash:comments><wfw:commentRss>http://m.tkk7.com/rosen/comments/commentRss/321575.html</wfw:commentRss><trackback:ping>http://m.tkk7.com/rosen/services/trackbacks/321575.html</trackback:ping><description><![CDATA[     摘要: 在^时工作过E中Q有时会遇到OutOfMemoryErrorQ我们知道遇到Error一般表明程序存在着严重问题Q可能是N性的。所以找出是什么原因造成OutOfMemoryError非常重要。现在向大家引荐Eclipse Memory Analyzer tool(MAT)Q来化解我们遇到的难题?nbsp; <a href='http://m.tkk7.com/rosen/archive/2010/05/21/321575.html'>阅读全文</a><img src ="http://m.tkk7.com/rosen/aggbug/321575.html" width = "1" height = "1" /><br><br><div align=right><a style="text-decoration:none;" href="http://m.tkk7.com/rosen/" target="_blank">Rosen</a> 2010-05-21 20:59 <a href="http://m.tkk7.com/rosen/archive/2010/05/21/321575.html#Feedback" target="_blank" style="text-decoration:none;">发表评论</a></div>]]></description></item></channel></rss> <footer> <div class="friendship-link"> <p>лǵվܻԴȤ</p> <a href="http://m.tkk7.com/" title="亚洲av成人片在线观看">亚洲av成人片在线观看</a> <div class="friend-links"> </div> </div> </footer> վ֩ģ壺 <a href="http://jcss99.com" target="_blank">Ů˱ͰúˬƵ</a>| <a href="http://8hnbuk14.com" target="_blank">רר</a>| <a href="http://shcxsoft.com" target="_blank">91޵ҹ</a>| <a href="http://fennenll.com" target="_blank">޹þþþƷ</a>| <a href="http://zgdhuibao.com" target="_blank">޾ƷŮþþþ99</a>| <a href="http://bznys.com" target="_blank">ƷѾƷ</a>| <a href="http://xawsfkaisuo.com" target="_blank">ƷƬ߹ۿ</a>| <a href="http://www-kj5799.com" target="_blank">޾ƷŮþþ</a>| <a href="http://jsjumei.com" target="_blank">AVAV˵</a>| <a href="http://youweidianqi.com" target="_blank">޾Ʒ޿</a>| <a href="http://05942688.com" target="_blank">޳AVƬ߹ۿ</a>| <a href="http://eddiekidd.com" target="_blank">AVӰԺ߹ۿ</a>| <a href="http://xmm5pkt.com" target="_blank">91޹߲ҹ</a>| <a href="http://zhaoxinwo.com" target="_blank">պĻ</a>| <a href="http://zhuoyueyc.com" target="_blank"> պ Ļ</a>| <a href="http://lianghao999.com" target="_blank">ŮƵ</a>| <a href="http://trgod.com" target="_blank">ŷ޹SUV</a>| <a href="http://020iws.com" target="_blank">һaƵۿվ</a>| <a href="http://91haikala.com" target="_blank">Ʒ69XXXƵ</a>| <a href="http://789xxoo.com" target="_blank">˿wwwƵ</a>| <a href="http://35633487.com" target="_blank">þaѹۿ</a>| <a href="http://91haikala.com" target="_blank">߹ۿHַ</a>| <a href="http://147v.com" target="_blank">պѹۿһëƬ</a>| <a href="http://qzapp88.com" target="_blank">ȫԼƵ</a>| <a href="http://yy885.com" target="_blank">˳˳߹ۿ</a>| <a href="http://shcxsoft.com" target="_blank">ۺһ</a>| <a href="http://xsjxp.com" target="_blank">aƬ߹ۿ</a>| <a href="http://skcncar.com" target="_blank">߹ۿ</a>| <a href="http://4p5e.com" target="_blank">պƵ</a>| <a href="http://686kp.com" target="_blank">þWWW˳һƬ</a>| <a href="http://345504.com" target="_blank">þþƷվѹۿ</a>| <a href="http://aierphoto.com" target="_blank">Ѵѧ߹ۿp</a>| <a href="http://www998xe.com" target="_blank">޾ƷƬ߹ۿ </a>| <a href="http://caicpa.com" target="_blank">Ʒ99þѹۿ</a>| <a href="http://pumanpig.com" target="_blank">131ŮëƬ</a>| <a href="http://yg1617.com" target="_blank">ĻmvѸƵ7</a>| <a href="http://j2eesp.com" target="_blank">ŪƵ</a>| <a href="http://23usxx.com" target="_blank">޾Ʒav߹ۿ</a>| <a href="http://my637.com" target="_blank">ҹһӰԺ</a>| <a href="http://by6174.com" target="_blank">³³ƵѲ</a>| <a href="http://91xx8.com" target="_blank">Դ߹ۿѰ</a>| <script> (function(){ var bp = document.createElement('script'); var curProtocol = window.location.protocol.split(':')[0]; if (curProtocol === 'https') { bp.src = 'https://zz.bdstatic.com/linksubmit/push.js'; } else { bp.src = 'http://push.zhanzhang.baidu.com/push.js'; } var s = document.getElementsByTagName("script")[0]; s.parentNode.insertBefore(bp, s); })(); </script> </body>