Users can now directly use flat files like CSV, JSON, Parquet, TSV, Avro and Sequence Files which are uploaded in AWS S3 instance and using the same they can create reports/dashboards inside Helical Insight. Following steps need to be followed.
Step 1: Download, install and start Apache Drill. To learn How to Install Drill.
Step 2: Enable S3 storage in apache drill by visiting http://IPAddress:8047/storage section.
For reference we have provided sample configuration of our AWS S3 instance.:
In S3 storage of apache drill, we have configured as below :
"type": "file", "enabled": true, "connection": "s3a://helicalinsight", "config": { "fs.s3a.access.key": "AKIAIJIZS7PFJOGR4EEA", "fs.s3a.secret.key": "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", "fs.s3a.endpoint": "s3.us-west-2.amazonaws.com" },
Step 3: Add below properties in core-site.xml file of apache drill which is located at …..\apache-drill-1.13.0\conf.
<configuration> <property> <name>fs.s3a.access.key</name> <value>AKIAIJIZS7PFJOGR4EEA</value> </property> <property> <name>fs.s3a.secret.key</name> <value>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</value> </property> <property> <name>fs.s3a.endpoint</name> <value>s3-us-west-2.amazonaws.com</value> </property> </configuration>
Note: Provide your AWS S3 details in the value tags. Above we have provided our details for just reference.
Step 4: After modifying the above XML file, restart service of apache drill.
Step 5: Now open the directory where Helical Insight application is installed.
Visit ..\Helical Insight\hi\hi-repository\System\Admin\DbConfig location and edit drill.efwd file.
In drill.efwd file look for else if("${schemas}".contains("dfs.")) and replace this line with below line : else if("${schemas}".contains("dfs.") || "${schemas}".contains("s3.")) // basically we have added configuration to ready info from drill s3 storage. Save this file and close it.
Step 6: Now login into Helical Insight application. Go to the Metadata tab. You will able to see s3.root schema inside drill connection. Expand this schema and you will able to see all your flat files located inside your AWS S3 instance.
Step 7: Create metadata by selecting flat files which you want to use for report creation. Save metadata
Step 8: By connecting to that metadata, you can start creating reports and dashboards.