Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.
-
Updated
Nov 27, 2024 - Java
Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.
Scriptis is for interactive data analysis with script development(SQL, Pyspark, HiveQL), task submission(Spark, Hive), UDF, function, resource management and intelligent diagnosis.
Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.
Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.
Supercharge Your Compute for Analytics & AI
Twitter data analysis using hadoop (hdfs), flume, map-reduce and hive. Sentiment Analysis is also done using affin dictionary for tweets related to Indian election.
Banking Data Analysis Using SQL ,SQOOP, HIVE, HADOOP, TABLEAU, R, UNIX
Hive Query Language example with Apache Hive, Apache Hadoop, Java
Apache Hive Query Language example
Spark Bulk Data Load. This is a data engineering project. Ingest the data from Hive tables present in the Master Data Management Platform and do the all processing in spark.
Add a description, image, and links to the hive-table topic page so that developers can more easily learn about it.
To associate your repository with the hive-table topic, visit your repo's landing page and select "manage topics."