Strategy ONE
Import Data from Hadoop Gateway
You can import the following file types from a Hadoop Distributed File System (HDFS): .avro, .csv, .json, .orc, .parquet, .txt.
If you choose to import files that don't have extension, you will be prompted by the Confirm File Type dialog to identify the file type.
- Create a document or open an existing one.
- Choose Data > Add Dataset.
- Click Add External Data.
- Hover over Hadoop and click Browse Hadoop Files.
- Enter your connection credentials in the Data Source dialog and click Save.
- Drag your files from the left pane to the right pane.
- Click Finish.
- Click Connect Live or Import as an In-Memory Dataset.
The connectivity timeout between Hadoop Gateway and Intelligence Server is 20 minutes by default. To increase this timeout limit, create a file named QueryDSServerTimeout.ini
and place it in your Intelligence Server directory. The only entry in this file will be the numeric value (in minutes) for your timeout limit. Placing a value of -1 in this file will set the timeout to unlimited.