HP Vertica Hadoop Distributed File System (HDFS) Connector

HP Vertica was the first analytic database company to deliver a Hadoop Connector. HP Vertica now offers two connectors to transfer data seamlessly between Hadoop and HP Vertica:
  1. The Hadoop Distributed File System (HDFS) connector enables you to load data from HDFS using the HP Vertica native COPY facility. This mechanism simplifies and accelerates the process of loading data stored in HDFS without any MapReduce coding. The connector also ensures that data is loaded from the Hadoop cluster with the optimal amount of parallelism. By using the connector with the HP Vertica External Tables feature, you can even query data in HDFS without copying data into HP Vertica.
  2. The Hadoop & Pig Connector is bidirectional and enables you to move data from Hadoop to HP Vertica or vice versa via either MapReduce or Pig jobs.
With HP Vertica HDFS and Pig Connectors, you have unprecedented flexibility and speed in loading data from HDFS to the HP Vertica Analytics Platform and querying data from the HP Vertica Analytics Platform in Hadoop. The HP Vertica HDFS and Pig Connectors are open source, supported by HP Vertica, and available for download.
HP Vertica provides optimized JDBC and ODBC client drivers for most platforms including Windows, Linux, Solaris, AIX, and others.

Leave a Reply

Your email address will not be published. Required fields are marked *