Accessing External Data with PXF
Data managed by your organization may already reside in external sources such as Hadoop, object stores, and other SQL databases. The LightDB-A Platform Extension Framework (PXF) provides access to this external data via built-in connectors that map an external data source to a LightDB-A Database table definition.
PXF is installed with Hadoop and Object Storage connectors. These connectors enable you to read external data stored in text, Avro, JSON, RCFile, Parquet, SequenceFile, and ORC formats. You can use the JDBC connector to access an external SQL database.
Note In previous versions of LightDB-A Database, you may have used the
gphdfs
external table protocol to access data stored in Hadoop. LightDB-A Database version 6.0.0 removes thegphdfs
protocol. Use PXF and thepxf
external table protocol to access Hadoop in LightDB-A Database version 6.x.
The LightDB-A Platform Extension Framework includes a C-language extension and a Java service. After you configure and initialize PXF, you start a single PXF JVM process on each LightDB-A Database segment host. This long- running process concurrently serves multiple query requests.
For detailed information about the architecture of and using PXF, refer to the LightDB-A Platform Extension Framework (PXF) documentation.
Parent topic: Working with External Data
Parent topic: Loading and Unloading Data