ZhangZhihui's Blog  

2025年11月26日

摘要: newsgroups = newsgroups.drop('tf_features') # dropping non-existed column doesn't cause an error 阅读全文
posted @ 2025-11-26 15:15 ZhangZhihuiAAA 阅读(5) 评论(0) 推荐(0)
 
摘要: from pyspark.sql.functions import expr df_filtered = df_filtered.withColumn('filtered_array', expr('filter(filtered_doc, x -> len(x) >= 4)')) Please h 阅读全文
posted @ 2025-11-26 11:03 ZhangZhihuiAAA 阅读(2) 评论(0) 推荐(0)
 
摘要: In PostgreSQL, you can convert a timestamp to a date (i.e., drop hours/minutes/seconds) in several common ways: ✅ 1. Cast to date Fastest and simplest 阅读全文
posted @ 2025-11-26 08:29 ZhangZhihuiAAA 阅读(15) 评论(0) 推荐(0)