This book has step-by-step instructions to solve data manipulation problems using PDI in the form of recipes. It has plenty of well-organized tips, screenshots, tables, and examples to aid quick and easy understanding. If you are a software developer or anyone involved or interested in developing ET
Pentaho Data Integration Cookbook
✍ Scribed by Alex Meadows
- Publisher
- Packt Publishing
- Year
- 2013
- Tongue
- English
- Leaves
- 462
- Category
- Library
No coin nor oath required. For personal study only.
✦ Synopsis
Pentaho Data Integration Cookbook Second Edition is written in a cookbook format, presenting examples in the style of recipes.This allows you to go directly to your topic of interest, or follow topics throughout a chapter to gain a thorough in-depth knowledge.Pentaho Data Integration Cookbook Second Edition is designed for developers who are familiar with the basics of Kettle but who wish to move up to the next Read more...
✦ Table of Contents
Content: Cover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Working with Databases; Introduction; Connecting to a database; Getting data from a database; Getting data from a database by providing parameters; Getting data from a database by running a query built at runtime; Inserting or updating rows in a table; Inserting new rows where a simple primary key has to be generated; Inserting new rows where the primary key has to be generated based on stored values; Deleting data from a table Creating or altering a database table from PDI (design time)Creating or altering a database table from PDI (runtime); Inserting, deleting, or updating a table depending on a field; Changing the database connection at runtime; Loading a parent-child table; Building SQL queries via database metadata; Performing repetitive database design tasks from PDI; Chapter 2: Reading and Writing Files; Introduction; Reading a simple file; Reading several files at the same time; Reading semi-structured files; Reading files having one field per row; Reading files with some fields occupying two or more rows Writing a simple fileWriting a semi-structured file; Providing the name of a file (for reading or writing) dynamically; Using the name of a file (or part of it) as a field; Reading an Excel file; Getting the value of specific cells in an Excel file; Writing an Excel file with several sheets; Writing an Excel file with a dynamic number of sheets; Reading data from an AWS S3 Instance; Chapter 3: Working with Big Data and Cloud Sources; Introduction; Loading data into Salesforce.com; Getting data from Salesforce.com; Loading data into Hadoop; Getting data from Hadoop; Loading data into HBase Getting data from HBaseLoading data into MongoDB; Getting data from MongoDB; Chapter 4: Manipulating XML Structures; Introduction; Reading simple XML files; Specifying fields by using Path notation; Validating well-formed XML files; Validating an XML file against DTD definitions; Validating an XML file against an XSD schema; Generating a simple XML document; Generating complex XML structures; Generating an HTML page using XML and XSL transformations; Reading an RSS Feed; Generating an RSS Feed; Chapter 5: File Management; Introduction; Copying or moving one or more files Deleting one or more filesGetting files from a remote server; Putting files on a remote server; Copying or moving a custom list of files; Deleting a custom list of files; Comparing files and folders; Working with ZIP files; Encrypting and decrypting files; Chapter 6: Looking for Data; Introduction; Looking for values in a database table; Looking for values in a database with complex conditions; Looking for values in a database with dynamic queries; Looking for values in a variety of sources; Looking for values by proximity; Looking for values by using a web service
Abstract: Pentaho Data Integration Cookbook Second Edition is written in a cookbook format, presenting examples in the style of recipes.This allows you to go directly to your topic of interest, or follow topics throughout a chapter to gain a thorough in-depth knowledge.Pentaho Data Integration Cookbook Second Edition is designed for developers who are familiar with the basics of Kettle but who wish to move up to the next level.It is also aimed at advanced users that want to learn how to use the new features of PDI as well as and best practices for working with Kettle
📜 SIMILAR VOLUMES
Pentaho Data Integration 4 Cookbook; Pentaho Data Integration 4 Cookbook; Credits; About the Authors; About the Reviewers; www.PacktPub.com; Support files, eBooks, discount offers and more; Why Subscribe?; Free Access for Packt account holders; Preface; What this book covers; What you need for this
<p>Over 100 recipes for building open source ETL solutions with Pentaho Data Integration</p> <p><b>Overview</b></p> <ul> <li>Intergrate Kettle in integration with other components of the Pentaho Business Intelligence Suite, to build and publish Mondrian schemas,create reports, and populatedashboards
<p>Over 100 recipes for building open source ETL solutions with Pentaho Data Integration</p> <p><b>Overview</b></p> <ul> <li>Intergrate Kettle in integration with other components of the Pentaho Business Intelligence Suite, to build and publish Mondrian schemas,create reports, and populatedashboards
<p>Over 100 recipes for building open source ETL solutions with Pentaho Data Integration</p> <p><b>Overview</b></p> <ul> <li>Intergrate Kettle in integration with other components of the Pentaho Business Intelligence Suite, to build and publish Mondrian schemas,create reports, and populatedashboards