Skip to content

Commit

Permalink
Merge pull request #1 from dyllamt/feature/update-readme
Browse files Browse the repository at this point in the history
use self-hosted runner
  • Loading branch information
dyllamt authored Dec 5, 2023
2 parents 2b5b180 + 80d32da commit af9ed87
Show file tree
Hide file tree
Showing 5 changed files with 10 additions and 2 deletions.
3 changes: 3 additions & 0 deletions .github/workflows/pull-request.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -46,6 +46,9 @@ jobs:
- name: K8s Tests - Run Jobs
run: |
kind load docker-image kafka-producer --name github-cluster
kind load docker-image spark-jobs --name github-cluster
kind load docker-image trino-queries --name github-cluster
cd scripts
./04-run-spark-jobs.sh
./05-run-trino-query.sh
3 changes: 2 additions & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -8,6 +8,7 @@ Apache data system written for development in a local cluster and incramental de
- `Kafka` event source to ingest realtime application data
- `Spark` framework for microbatch and batch processes
- `Delta` ACID-compliant storage layer on file storage
- `Hive` metadata store for the delta schemas
- `Trino` analytics query engine for ad-hoc analysis

## System Tests
Expand All @@ -19,7 +20,7 @@ An end-to-end test of the system can be run in kubernetes. The test:
3. performs windowed aggregations on the data and saves the results
4. triggers a sql analytics query through trino to simulate an analyst

The tests are triggered through github actions.
The tests are triggered through github actions, although you will need to use a self-hosted runner.


## Developer Notes
Expand Down
3 changes: 3 additions & 0 deletions scripts/04-run-spark-jobs.sh
Original file line number Diff line number Diff line change
Expand Up @@ -20,6 +20,8 @@ while : ; do
exit 1
fi
if [[ $elapsed -ge $TIMEOUT ]]; then
kubectl describe sparkapplication $APP_NAME -n $NAMESPACE
kubectl logs pyspark-ingest-driver -n dev
echo "Timed out waiting for the application to start."
exit 1
fi
Expand Down Expand Up @@ -51,6 +53,7 @@ while : ; do
exit 1
fi
if [[ $elapsed -ge $TIMEOUT ]]; then
kubectl describe sparkapplication $APP_NAME -n $NAMESPACE
echo "Timed out waiting for the application to finish."
exit 1
fi
Expand Down
3 changes: 2 additions & 1 deletion scripts/05-run-trino-query.sh
Original file line number Diff line number Diff line change
@@ -1 +1,2 @@
kubectl apply -f ../k8s/trino/trino-query.yaml -n dev
kubectl apply -f ../k8s/trino/trino-query.yaml -n dev
kubectl wait --for=condition=complete --timeout=60s job/trino-query-job -n dev
Binary file modified system-design.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

0 comments on commit af9ed87

Please sign in to comment.