【陳懷臨:這個年輕人很優秀。優秀的不僅僅是能把這些經典論文看完,而是還花心思翻譯出來。他的博客是:銀河里的星星。其聯系方式是phylips@bmy】 分布式領域論文譯序 sql&nosql年代記 海量數據的存儲計算和查詢 一.google論文系列 1. google系列論文譯序 2. The anatomy of a large-scale hypertextual Web search engine 3. 面向星球的網絡搜索:google集群架構 4. GFS:google文件系統 5. mapreduce: Simplied Data Processing on Large Clusters 6. bigtable: A Distributed Storage System for Structured Data 7. Chubby: The Chubby lock service for loosely-coupled distributed systems 8. Sawzall:Interpreting the Data–Parallel Analysis with Sawzall 9. Pregel: A System for Large-Scale Graph Processing 10. Dremel: Interactive Analysis of WebScale Datasets 11. Percolator: Large-scale Incremental Processing Using Distributed Transactions and Notifications 12. MegaStore: Providing Scalable, Highly Available Storage for Interactive Services 13. Case Study GFS: Evolution on Fast-forward 14. Google File System II: Dawn of the Multiplying Master Nodes google系列論文翻譯集(合集) 二.分布式理論系列 00. Appraising Two Decades of Distributed Computing Theory Research 0. How to Build a Highly Available System Using Consensus 1. 分布式理論系列譯序 2. A brief history of Consensus_ 2PC and Transaction Commit 3. 拜占庭將軍問題 –Leslie Lamport 4. Impossibility of distributed consensus with one faulty process 5. Leases:租約機制 6. paxos made simple 7. 2pc-2階段提交:Notes on Data Base Operating Systems –Jim Gray 8. 3pc-3階段提交:NONBLOCKING COMMIT PROTOCOLS 9. Life beyond Distributed Transactions:an Apostate’s Opinion 10. A Comparison of the Byzantine Agreement Problem and the Transaction Commit Problem –Jim Gray 11. Uniform consensus is harder than consensus 12. Consensus on Transaction Commit –Jim Gray & Leslie Lamport 13. Paxos Made Live – An Engineering Perspective 14. Fast Paxos –Leslie Lamport 15. The Part Time Parliament –Leslie Lamport 16. Time Clocks and the Ordering of Events in a Distributed System –Leslie Lamport 17. Distributed Snapshots: Determining Global States of a Distributed System –Leslie Lamport 18. Virtual Time and Global States of Distributed Systems 19. Timestamps in Message-Passing Systems That Preserve the Partial Ordering 20. Fundamentals of Distributed Computing:A Practical Tour of Vector Clock Systems 三.Sql&NoSql理論系列 0. Towards Robust Distributed Systems:Brewer’s 2000 PODC key notes 1. CAP理論 2. Harvest, Yield, and Scalable Tolerant Systems 3. Brewer’s conjecture and the feasibility of consistent, available, partition- tolerant web services 4. BASE模型:BASE an Acid Alternative 5. 最終一致性 6. 可擴展性設計模式 7. 可伸縮性原則 8. MapReduce:一個巨大的倒退 9. The Five-Minute Rule20 Years Later(and How Flash Memory Changes the Rules) 10. The Transaction Concept:Virtues and Limitations –Jim Gray 11. NoSql生態系統 12. scalability-availability-stability-patterns 四.基本算法和數據結構 1. 大數據量,海量數據處理方法總結 2. 大數據量,海量數據處理方法總結(續) 3. Consistent Hashing And Random Trees 4. Merkle Trees 5. Scalable Bloom Filters 6. Introduction to Distributed Hash Tables 7. B-Trees and Relational Database Systems 8. The log-structured merge-tree 9. lock free data structure 10. Data Structures for Spatial Database 11. Gossip 12. lock free algorithm 13. The Graph Traversal Pattern 五.基本系統和實踐經驗 1. Berkeley DB white paper 2. Dynamo: Amazon’s Highly Available Key-value Store 3. Cassandra – A Decentralized Structured Storage System 4. PNUTS: Yahoo!’s Hosted Data Serving Platform 5. Hive – A Petabyte Scale Data Warehouse Using Hadoop 5. Benchmarking Cloud Serving Systems with YCSB 6. Designs, Lessons and Advice from Building Large Distributed Systems –Jeff Dean 7. Challenges in Building Large-Scale Information Retrieval Systems –Jeff Dean 8. Apache Hadoop Goes Realtime at Facebook 9. Hadoop平臺優化綜述 六.其他輔助系統 1. The ganglia distributed monitoring system:design, implementation, and experience 2. Chukwa: A large-scale monitoring system 七.其他 Reflections on Trusting Trust –Ken Thompson Who Needs an Architect? Go To statements considered harmfull –Edsger W.Dijkstra No Silver Bullet Essence and Accidents of Software Engineering –Frederick P. Brooks |