Connecting To Flat Files Of S3 Storage Using Apache Drill

Users can now directly use flat files like CSV, JSON, Parquet, TSV, Avro and Sequence Files which are uploaded in AWS S3 instance and using the same they can create reports/dashboards inside Helical Insight. Following steps need to be followed.

Step 1: Download, install and start Apache Drill. To learn How to Install Drill.

Step 2: Enable S3 storage in apache drill by visiting http://IPAddress:8047/storage section.

For reference we have provided sample configuration of our AWS S3 instance.:

In S3 storage of apache drill, we have configured as below :

"type": "file",
  "enabled": true,
  "connection": "s3a://helicalinsight",
  "config": {
    "fs.s3a.access.key": "AKIAIJIZS7PFJOGR4EEA",
    "fs.s3a.secret.key": "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",
    "fs.s3a.endpoint": "s3.us-west-2.amazonaws.com"
  },

Step 3: Add below properties in core-site.xml file of apache drill which is located at …..\apache-drill-1.13.0\conf.

<configuration>

    <property>
        <name>fs.s3a.access.key</name>
        <value>AKIAIJIZS7PFJOGR4EEA</value>
    </property>

    <property>
        <name>fs.s3a.secret.key</name>
        <value>f8QLtTWMPnzJqmCDJB3Ku1FkdR29WKPJfDbXIXGn</value>
    </property>
	
	<property>
        <name>fs.s3a.endpoint</name>
        <value>s3-us-west-2.amazonaws.com</value>
    </property>

</configuration>

Note: Provide your AWS S3 details in the value tags. Above we have provided our details for just reference.

Step 4: After modifying the above XML file, restart service of apache drill.

Step 5: Now open the directory where Helical Insight application is installed.

Visit ..\Helical Insight\hi\hi-repository\System\Admin\DbConfig location and edit drill.efwd file.

In drill.efwd file look for 
else if("${schemas}".contains("dfs.")) and replace this line with below line :
else if("${schemas}".contains("dfs.") || "${schemas}".contains("s3."))  
// basically we have added configuration to ready info from drill s3 storage. Save this file and close it.

Step 6: Now login into Helical Insight application. Go to the Metadata tab. You will able to see s3.root schema inside drill connection. Expand this schema and you will able to see all your flat files located inside your AWS S3 instance.

Step 7: Create metadata by selecting flat files which you want to use for report creation. Save metadata

Step 8: By connecting to that metadata, you can start creating reports and dashboards.

Connecting To Flat Files Of S3 Storage Using Apache Drill in Helical Insight

Useful Links

Follow Us

Important Links

Demos

Partners

Connecting To Flat Files Of S3 Storage Using Apache Drill in Helical Insight

You May Also Like

Change the report in Dashboard Designer

Sub Chart Option

Useful Links

Follow Us

Important Links

Demos

Partners