Tutorials

These tutorials guide you through running Spark jobs on HPC clusters using sparkctl. Each tutorial covers a different workflow - choose the one that best fits your needs.

Which Tutorial Should I Use?

Tutorial

Best For

Client Install

Interface

spark-submit / pyspark

Traditional Spark users, production jobs

sparkctl[pyspark] (full)

CLI

Spark Connect CLI

Lightweight client, remote connectivity

sparkctl

CLI

Python Library

Programmatic control, automation scripts

sparkctl

Python

Interactive Development

Exploratory analysis, debugging

sparkctl

Python REPL

Decision Guide

Start here if you’re new to sparkctl: spark-submit / pyspark - this is the most familiar workflow for existing Spark users.

Choose by use case: