Programming Pig (Alan F Gates)

 
0.0 (0)
Programming Pig (Alan F Gates)

This manual is a fantastic resource for learning Apache Pig as well as a reference for running large data projects on Hadoop. Pig allows you to examine data without having to develop a complete application, making it simple for you to test out different data sets.

The data flow language, Pig Latin, is useful for creating and managing pipelines for data processing. Programming Pig demonstrates how to get started and teaches intermediate users the advantages of using Pig Latin. Advanced users understand how to construct systems for sophisticated data processing requirements by integrating Pig Latin into scripting languages, as well as how to construct complex data processing pipelines using Pig's macros and modularity features.

  • Learn the advantages and disadvantages of using Pig instead of MapReduce
  • Understand how Pig fits in with other Hadoop components, such as HDFS, Hive, MapReduce, and HBase
  • Follow examples that explain built-in Pig Latin functions, and data operators such as join and group
  • Use grunt, the shell that Pig provides for exploring and working with HDFS
  • Get performance tuning tips for running Pig Latin scripts on Hadoop clusters in less time
  • Extend Pig with powerful user-defined functions written in Java or Python

Ebook Details

Author(s)
About the Authors
The implementation of the language by the corporation, including the design of the programming interfaces, is the responsibility of Alan Gates, a member of Yahoo's Pig development team. He has discussed Pig at several conferences, user groups, universities, and Pig-using businesses. When Pig transitioned from a research project to a production project, Alan managed the virtually complete rewrite of the code base.
Published
Published Date / Year
(October 22, 2011)
Hardcover
224 pages
ISBN-10
1449302645
ISBN-13
978-1449302641

Similar Programming & Computer Books

Tcl/Tk Tutorial (Tutorials Point)
A multi-paradigm, general-purpose system programming language is Tcl. It is a scripting language with the goal of enabling apps to speak to one another. ...
Mastering Perl/Tk (Steve Lidie, et al)
Perl/Tk is a strong programming language that combines the Tk graphical toolkit with Perl, which is mostly used for system management, web development, and database processing. With Perl/Tk, you...
Tcl/Tk for Programmers: With Solved Exercises that Work with Unix and Windows (J. Adrian Zimmer)
The gaps between introductions, thorough manuals, and collections of scripts that address specific issues are filled by this introduction to Tcl/Tk. On both Windows and Unix platforms, there are...
Tcl Programming (Richard Suchenwirth)
This free programming book provides a thorough introduction to the that begins with the fundamentals before broadening its focus to cover the more complex ideas, features, and programming idioms...
Dictionary of Programming Languages (Neal Ziring)
The Dictionary of Programming Languages is an online reference guide to computer coding techniques put together to educate and enhance your understanding of computer science history. Over 120 words...
A Beginner's Guide to Forth (J. V. Noble)
Programming languages of the past have nothing on the Forth programming language. It doesn't resemble C, Java, Pascal, Lisp, Python, etc. at all. ...
Thoughtful Programming and Forth (Jeff Fox)
This book was initially intended to be used in Forth programming classes, where students complete the exercises for each chapter on a computer. ...
Programming Forth (Stephen Pelc)
A full introduction to the Forth programming language may be found in Programming Forth. Written for readers with programming experience but perhaps not prior Forth exposure, it provides thorough...
What is Dart? (Kathy Walrath, et al)
This free brief booklet introduces the Google Dart language, libraries, and development resources that support the creation of structured, quick, and maintainable web applications that work in any current...
Learning Dart (Rip Tutorial)
Dart is a class-based, object-oriented language that makes it easier to create structured modern apps. It can be used to create simple scripts as well as huge applications, and...

Others Programming Books by O'Reilly Media

Mastering Perl/Tk (Steve Lidie, et al)
Perl/Tk is a strong programming language that combines the Tk graphical toolkit with Perl, which is mostly used for system management, web development, and database processing. With Perl/Tk, you...
Java Security (Scott Oaks)
Java Security by Scott Oakes is exceptional in both its technical breadth and readability. It offers a thorough introduction of the Java security architecture and security classes, as well...
O'Reilly® Java AWT Reference (John Zukowski)
The Abstract Window Toolkit (AWT), a sizable collection of classes for creating graphical user interfaces in Java, is completely referenced in the Java AWT Reference. You can make windows,...
Free as in Freedom: Richard Stallman's Crusade for Free Software (Sam Williams)
freely as in Richard Stallman, the man behind the GNU project, is profiled in Freedom along with the political, social, and economic history of the free software movement. It...
Greasemonkey Hacks: Tips & Tools for Remixing the Web with Firefox (Mark Pilgrim)
For hardcore users who wish to learn Greasemonkey, the hottest new Firefox plugin that enables you to write scripts that modify the web pages you see, this book is...
Hacking Kubernetes: Threat-Driven Analysis and Defense (Andrew Martin, et al)
To operate your Kubernetes workloads in a secure and reliable manner, A threat-based overview of Kubernetes security is provided in this useful open-source book. ...
What is Dart? (Kathy Walrath, et al)
This free brief booklet introduces the Google Dart language, libraries, and development resources that support the creation of structured, quick, and maintainable web applications that work in any current...
Natural Language Processing with Python – Analyzing Text with the Natural Language Toolkit (Steven Bird, et al)
The Natural Language Toolkit (NLTK) book is updated for Python 3 and NLTK 3 in this online edition from 2015.  
Cascading Style Sheets: The Definitive Guide (Eric A. Meyer)
The Web Design CD Bookshelf, Version 1.0, includes this book. To put it simply, CSS is a method for separating a document's structure from its presentation. The...
Developing on AWS with C#: A Comprehensive Guide on Using C# to Build Solutions on the AWS Platform (Noah Gift, et al)
You are guided through the process of transitioning your monolithic application to microservices on AWS by this helpful book.  
Managing Projects with GNU Make: The Power of GNU make for Building Anything (Robert Mecklenburg)
One of the most enduring elements of both Unix and other operating systems is the utility known simply as make. Make, which was first developed in the 1970s, is...
Programming Embedded Systems in C and C++ (Michael Barr)
This free programming book's practical, no-nonsense approach will assist you in getting started by providing useful guidance from a person who has been in your position before and wants...
Planning for Big Data: A CIO's Handbook to the Changing Data Landscape (Edd Dumbill)
This free programming book offers a useful, approachable "brief" on the state of Big Data analytics today and how you may profitably use this technology to boost your company's...
Big Data Now: Current Perspectives from O'Reilly Radar (O'Reilly Radar Team)
This free programming book summarizes the report's findings on trends, techniques, applications, and predictions.  
Designing Event-Driven Systems: Concepts and Patterns for Streaming Services with Apache Kafka (Ben Stopford)
In Concepts and Patterns for Streaming Services with Apache Kafka, the author discusses how you may create mission-critical systems using service-based architectures and stream processing tools like Apache Kafka....
Visual Basic 2005: A Developer's Notebook (Matthew MacDonald)
The optimum test track is provided in this free programming book. This practical introduction to VB 2005 will get you up to speed on all the new features of...
Ajax Design Patterns (Michael Mahemoff)
You will learn best practices in this free programming book that will significantly enhance your web development initiatives. It looks at how others have resolved conflicts between design principles...
The Java Reference Library CD Bookshelf, 5 Bestselling Books (O'Reilly & Associates)
A Java programmer's fantasy has come true with the Java Reference Library CD Bookshelf.  
The Java Enterprise CD Bookshelf, 7 Bestselling Books on CD-ROM (O'Reilly & Associates)
Any Web browser can read The Java Enterprise CD Bookshelf because it is formatted in HTML. The books are fully cross-referenced and searchable. ...
Docker for Java Developers (Arun Gupta)
This free programming book demonstrates how to speed up the starting and deployment of your Java-based apps while introducing fundamental Docker concepts. You'll discover how Docker containers increase machine...

User reviews

There are no user reviews for this listing.
Ratings
Rate this Book
Comments