Moving Hadoop to the Cloud: Harnessing Cloud Features and Flexibility for Hadoop Clusters

"O'Reilly Media, Inc."
Free sample

Until recently, Hadoop deployments existed on hardware owned and run by organizations. Now, of course, you can acquire the computing resources and network connectivity to run Hadoop clusters in the cloud. But there’s a lot more to deploying Hadoop to the public cloud than simply renting machines.

This hands-on guide shows developers and systems administrators familiar with Hadoop how to install, use, and manage cloud-born clusters efficiently. You’ll learn how to architect clusters that work with cloud-provider features—not just to avoid pitfalls, but also to take full advantage of these services. You’ll also compare the Amazon, Google, and Microsoft clouds, and learn how to set up clusters in each of them.

  • Learn how Hadoop clusters run in the cloud, the problems they can help you solve, and their potential drawbacks
  • Examine the common concepts of cloud providers, including compute capabilities, networking and security, and storage
  • Build a functional Hadoop cluster on cloud infrastructure, and learn what the major providers require
  • Explore use cases for high availability, relational data with Hive, and complex analytics with Spark
  • Get patterns and practices for running cloud clusters, from designing for price and security to dealing with maintenance
Read more
Collapse

About the author

Bill Havanki is a software engineer working for Cloudera, where he has contributed to Hadoop components as well as systems for deploying Hadoop clusters into public Cloud services. Prior to joining Cloudera he worked for 15 years developing software for government contracts, focusing mostly on analytic frameworks and authentication and authorization systems. He earned his B.S. in Electrical Engineering from Rutgers University and his M.S. in Computer Engineering from North Carolina State University. A New Jersey native, he currently lives near Annapolis, Maryland with his family.

Read more
Collapse
Loading...

Additional Information

Publisher
"O'Reilly Media, Inc."
Read more
Collapse
Published on
Jul 14, 2017
Read more
Collapse
Pages
338
Read more
Collapse
ISBN
9781491959589
Read more
Collapse
Read more
Collapse
Read more
Collapse
Language
English
Read more
Collapse
Genres
Computers / Data Processing
Computers / Databases / Servers
Computers / Enterprise Applications / General
Computers / Information Technology
Computers / Management Information Systems
Computers / Systems Architecture / Distributed Systems & Computing
Read more
Collapse
Content Protection
This content is DRM free.
Read more
Collapse
Read Aloud
Available on Android devices
Read more
Collapse
Eligible for Family Library

Reading information

Smartphones and Tablets

Install the Google Play Books app for Android and iPad/iPhone. It syncs automatically with your account and allows you to read online or offline wherever you are.

Laptops and Computers

You can read books purchased on Google Play using your computer's web browser.

eReaders and other devices

To read on e-ink devices like the Sony eReader or Barnes & Noble Nook, you'll need to download a file and transfer it to your device. Please follow the detailed Help center instructions to transfer the files to supported eReaders.
Despite the buzz surrounding the cloud computing, only a small percentage of organizations have actually deployed this new style of IT—so far. If you're planning your long-term cloud strategy, this practical book provides insider knowledge and actionable real-world lessons regarding planning, design, operations, security, and application transformation. This book teaches business and technology managers how to transition their organization's traditional IT to cloud computing. Rather than yet another book trying to sell or convince readers on the benefits of clouds, this book provides guidance, lessons learned, and best practices on how to design, deploy, operate, and secure an enterprise cloud based on real-world experience.



Author James Bond provides useful guidance and best-practice checklists based on his field experience with real customers and cloud providers. You'll view cloud services from the perspective of a consumer and as an owner/operator of an enterprise private or hybrid cloud, and learn valuable lessons from successful and less-than-successful organization use-case scenarios. This is the information every CIO needs in order to make the business and technical decisions to finally execute on their journey to cloud computing.

Get updated trends and definitions in cloud computing, deployment models, and for building or buying cloud servicesDiscover challenges in cloud operations and management not foreseen by early adoptersUse real-world lessons to plan and build an enterprise private or hybrid cloudLearn how to assess, port, and migrate legacy applications to the cloudIdentify security threats and vulnerabilities unique to the cloudEmploy a cloud management system for your enterprise (private or multi-provider hybrid) cloud ecosystemUnderstand the challenges for becoming an IT service broker leveraging the power of the cloud
The professional's one-stop guide to this open-source, Java-based big data framework

Professional Hadoop is the complete reference and resource for experienced developers looking to employ Apache Hadoop in real-world settings. Written by an expert team of certified Hadoop developers, committers, and Summit speakers, this book details every key aspect of Hadoop technology to enable optimal processing of large data sets. Designed expressly for the professional developer, this book skips over the basics of database development to get you acquainted with the framework's processes and capabilities right away. The discussion covers each key Hadoop component individually, culminating in a sample application that brings all of the pieces together to illustrate the cooperation and interplay that make Hadoop a major big data solution. Coverage includes everything from storage and security to computing and user experience, with expert guidance on integrating other software and more.

Hadoop is quickly reaching significant market usage, and more and more developers are being called upon to develop big data solutions using the Hadoop framework. This book covers the process from beginning to end, providing a crash course for professionals needing to learn and apply Hadoop quickly.

Configure storage, UE, and in-memory computing Integrate Hadoop with other programs including Kafka and Storm Master the fundamentals of Apache Big Top and Ignite Build robust data security with expert tips and advice

Hadoop's popularity is largely due to its accessibility. Open-source and written in Java, the framework offers almost no barrier to entry for experienced database developers already familiar with the skills and requirements real-world programming entails. Professional Hadoop gives you the practical information and framework-specific skills you need quickly.

Bring the best out of DevOps and build, deploy, and maintain applications on AWSAbout This BookWork through practical examples and gain DevOps best practices to successfully deploy applications on AWSSuccessfully provision and operate distributed application systems and your AWS infrastructure using DevOpsPerform Continuous Integration and deployment and fine-tune the way you deliver on AWSWho This Book Is For

This book is for system administrators and developers who manage AWS infrastructure and environments and are planning to implement DevOps in their organizations. Those aiming for the AWS Certified DevOps Engineer certification will also find this book useful. Prior experience of operating and managing AWS environments is expected.

What You Will LearnDesign and deploy infrastructure as code within your AWS Virtual Private CloudImplement Continuous Integration using AWS ServicesConfigure EC2 instances using SaltStackImplement Continuous Deployment using Jenkins and the AWS CLICollect important metrics and log data to gain more insight into infrastructure and applicationsTroubleshooting popular issues with some less known techniques using the AWS platformIn Detail

Knowing how to adopt DevOps in your organization is becoming an increasingly important skill for developers, whether you work for a start-up, an SMB, or an enterprise.

This book will help you to drastically reduce the amount of time spent on development and increase the reliability of your software deployments on AWS using popular DevOps methods of automation.

To start, you will get familiar with the concept of IaC and will learn to design, deploy, and maintain AWS infrastructure. Further on, you'll see how to design and deploy a Continuous Integration platform on AWS using either open source or AWS provided tools/services.

Following on from the delivery part of the process, you will learn how to deploy a newly created, tested, and verified artefact to the AWS infrastructure without manual intervention. You will then find out what to consider in order to make the implementation of Configuration Management easier and more effective.

Toward the end of the book, you will learn some tricks and tips to optimize and secure your AWS environment. By the end of the book, you will have mastered the art of implementing DevOps practices onto AWS.

Style and approach

This book is packed full of real-world examples demonstrating use cases that help you deploy DevOps best practices on AWS.

©2019 GoogleSite Terms of ServicePrivacyDevelopersArtistsAbout Google|Location: United StatesLanguage: English (United States)
By purchasing this item, you are transacting with Google Payments and agreeing to the Google Payments Terms of Service and Privacy Notice.