摘要:
from pyspark.sql import SparkSession spark = SparkSession.builder.appName("HDFS Read Example").getOrCreate() # Define the HDFS path hdfs_path = "hdfs:
阅读全文
posted @ 2025-01-26 22:33
ZhangZhihuiAAA
阅读(16)
推荐(0)
posted @ 2025-01-26 21:42
ZhangZhihuiAAA
阅读(8)
推荐(0)
摘要:
Singular Value Decomposition (SVD) is a powerful mathematical technique used in linear algebra to factorize a matrix into three simpler matrices. It i
阅读全文
posted @ 2025-01-26 16:37
ZhangZhihuiAAA
阅读(43)
推荐(0)
摘要:
1. Download: https://spark.apache.org/downloads.html 2. Install: (base) zzh@ZZHPC:~/Downloads/sfw$ tar -xvzf spark-3.5.4-bin-hadoop3.tgz 3. Set enviro
阅读全文
posted @ 2025-01-26 12:30
ZhangZhihuiAAA
阅读(20)
推荐(0)
摘要:
1. First, we will set up the imports that are required for the dashboard view: from __future__ import annotations from typing import TYPE_CHECKING fro
阅读全文
posted @ 2025-01-20 13:16
ZhangZhihuiAAA
阅读(22)
推荐(0)
摘要:
from airflow.decorators import ( dag, task, ) from pendulum import datetime @dag( schedule="@none", start_date=datetime(2025, 1, 1), catchup=False, de
阅读全文
posted @ 2025-01-19 23:18
ZhangZhihuiAAA
阅读(167)
推荐(0)
摘要:
To create a new connection, select the + to add a new record. from airflow.decorators import ( dag, task, ) from pendulum import datetime @dag( start_
阅读全文
posted @ 2025-01-19 22:38
ZhangZhihuiAAA
阅读(5)
推荐(0)
摘要:
To retrieve these images, I frequently make use of the NASA Astronomy Picture of the Day API (https://apod.nasa.gov/apod/astropix.html) to gather a ne
阅读全文
posted @ 2025-01-19 15:35
ZhangZhihuiAAA
阅读(28)
推荐(0)
摘要:
(.venv) frank@ZZHUBT:~/venvs/my_airflow_project$ airflow config get-value core executor SequentialExecutor
阅读全文
posted @ 2025-01-19 13:46
ZhangZhihuiAAA
阅读(8)
推荐(0)
摘要:
rank@ZZHUBT:~$ pip install airflowctl rank@ZZHUBT:~$ airflowctl init my_airflow_project --build-start ...... ebserver | [2025-01-18 20:46:08 +0800] [1
阅读全文
posted @ 2025-01-18 21:51
ZhangZhihuiAAA
阅读(23)
推荐(0)