jdbc_url = "jdbc:postgresql://localhost:5432/postgres" properties = { "user": "postgres", "password": "postgrespwd", "driver": "org.postgresql.Driver" } table_name = "player_seasons" df = spark.read.jdbc(url=jdbc_url, table=table_name, properties=properties) df.show()

Step 2: Make the JAR available to Spark

There are three common ways:

Option 1: Copy to Spark jars/ directory

Option 2: Pass the JAR when creating SparkSession

Option 3: Use Maven coordinates (auto-download)

Step 3: Retry reading the table

⚡ Note

Make sure your PostgreSQL is accessible from WSL.
If PostgreSQL is running on Windows, use the WSL IP of Windows host, e.g., jdbc:postgresql://<Windows-IP>:5432/postgres
localhost inside WSL points to WSL itself, not Windows.

jdbc_url = "jdbc:postgresql://172.26.64.1:5432/postgres"
properties = {
    "user": "postgres",
    "password": "postgrespwd",
    "driver": "org.postgresql.Driver"
}

table_name = "player_seasons"

df = spark.read.jdbc(url=jdbc_url, table=table_name, properties=properties)

df.show()

I chose option 1, do I need to restart Spark after copying the jar file? I have changed 'localhost' to Windows-IP, the code still got the same error.

Yes — you must restart Spark after copying the JAR into $SPARK_HOME/jars/.

Here’s why:

When you start Spark, it loads all JARs in $SPARK_HOME/jars/ into the JVM classpath.
If you copy a new JAR after Spark is already running, the running JVM cannot see it.
This is exactly why your code still shows:

✅ Correct steps after copying the JAR

Stop any running Spark master or workers:

Start Spark master & worker again:

Restart your Jupyter kernel (or Python process):

Then run your JDBC code again.

[Stage 0:>                                                          (0 + 1) / 1]
+-----------+---+------+------+-------+-------+----------+-----------+------------+---+---+---+---+------+--------+--------+-------+------+-------+------+
|player_name|age|height|weight|college|country|draft_year|draft_round|draft_number| gp|pts|reb|ast|netrtg|oreb_pct|dreb_pct|usg_pct|ts_pct|ast_pct|season|
+-----------+---+------+------+-------+-------+----------+-----------+------------+---+---+---+---+------+--------+--------+-------+------+-------+------+
+-----------+---+------+------+-------+-------+----------+-----------+------------+---+---+---+---+------+--------+--------+-------+------+-------+------+

导航

Step 1: Download the JDBC driver

Step 2: Make the JAR available to Spark

Option 1: Copy to Spark `jars/` directory

Option 2: Pass the JAR when creating SparkSession

Option 3: Use Maven coordinates (auto-download)

Step 3: Retry reading the table

⚡ Note

✅ Correct steps after copying the JAR


博客园 © 2004-2026 浙公网安备 33010602011771号浙ICP备2021040463号-3

导航

Step 1: Download the JDBC driver

Step 2: Make the JAR available to Spark

Option 1: Copy to Spark jars/ directory

Option 2: Pass the JAR when creating SparkSession

Option 3: Use Maven coordinates (auto-download)

Step 3: Retry reading the table

⚡ Note

ChatGPT said:

✅ Correct steps after copying the JAR

Option 1: Copy to Spark `jars/` directory