Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ
Tải xuống
We live in the data age. It’s not easy to measure the total volume of data stored electronically, but an IDC estimate put the size of the “digital universe” at 0.18 zettabytes in 2006 and is forecasting a tenfold growth by 2011 to 1.8 zettabytes.1 A zettabyte is 1021 bytes, or equivalently one thousand exabytes, one million petabytes, or one billion terabytes. That’s roughly the same order of magnitude as one disk drive for every person in the world. | Storage and Analysis at Internet Scale The Definitive Guide O REILLY8 Tom White Programming Languages Iladoop Hadoop The Definitive Guide Ready to unlock the power of your data With this comprehensive guide you ll learn how to build and maintain reliable scalable distributed systems with Apache Hadoop. This book is ideal for programmers looking to analyze datasets of any size and for administrators who want to set up and run Hadoop clusters. You ll find illuminating case studies that demonstrate how Hadoop is used to solve specific problems. This third edition covers recent changes to Hadoop including material on the new MapReduce API as well as MapReduce 2 and its more flexible execution model YARN . Store large datasets with the Hadoop Distributed File System HDFS Run distributed computations with MapReduce Use Hadoop s data and I O building blocks for compression data integrity serialization including Avro and persistence Discover common pitfalls and advanced features for writing real-world MapReduce programs Design build and administer a dedicated Hadoop cluster or run Hadoop in the cloud Load data from relational databases into HDFS using Sqoop Perform large-scale data processing with the Pig query language Analyze datasets with Hive Hadoop s data warehousing system Take advantage of HBase for structured and semi-structured data and ZooKeeper for building distributed systems Strata Making Data Work Strata is the emerging ecosystem of people tools and technologies that turn big data into smart decisions. Find information and resources at oreilly.com data. No w you have the opportunity to learn about Hadoopfrom a master not only of the technology but also of common sense andplain talk. Doug Cutting Cloudem Tom White an engineer at Clotidera and member of the Apache Software Foundation has been an Apache Hadoop committer since February 2007. He has written numerous articles for oreilly.com java.net and IBM s developerWorks and speaks regularly about Hadoop at .