-
Notifications
You must be signed in to change notification settings - Fork 28.8k
Pull requests: apache/spark
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[SPARK-53572][SQL] Avoid throwing from ExtractValue.isExtractable
SQL
#52333
opened Sep 13, 2025 by
vladimirg-db
Loading…
[SPARK-43579][PYTHON] optim: Cache the converter between Arrow and pandas for reuse
PYTHON
SQL
#52332
opened Sep 13, 2025 by
petern48
Loading…
[SPARK-53361][SS][1/2] Optimizing JVM–Python Communication in TWS by Grouping Multiple Keys into One Arrow Batch
CORE
PYTHON
SQL
STRUCTURED STREAMING
#52331
opened Sep 12, 2025 by
zeruibao
Loading…
[SPARK-] Fix AnalysisContext being wiped during nested plan resolution
SQL
#52330
opened Sep 12, 2025 by
andyl-db
Loading…
[SPARK-53547] [Python] Add make_timestamp_ntz overload with date/time parameters
CONNECT
PYTHON
SQL
#52329
opened Sep 12, 2025 by
Yicong-Huang
Loading…
[SPARK-53560][SS][SQL] Crash looping when retrying uncommitted batch in Kafka source and AvailableNow trigger
SQL
STRUCTURED STREAMING
#52327
opened Sep 12, 2025 by
eason-yuchen-liu
Loading…
[SPARK-53560][SS][SQL] Crash looping when retrying uncommitted batch in Kafka source and AvailableNow trigger
SQL
STRUCTURED STREAMING
#52326
opened Sep 12, 2025 by
eason-yuchen-liu
Loading…
[SPARK-53568][CONNECT][PYTHON] Fix several small bugs in Spark Connect Python client error handling logic
CONNECT
PYTHON
SQL
#52325
opened Sep 12, 2025 by
khakhlyuk
Loading…
[SPARK-53524][CONNECT][SQL][4.0] Fix temporal value conversion in LiteralValueProtoConverter
CONNECT
SQL
#52324
opened Sep 12, 2025 by
heyihong
Loading…
[WIP][PYTHON] Make
@udf
support vectorized UDF
BUILD
CONNECT
PYTHON
SQL
#52323
opened Sep 12, 2025 by
zhengruifeng
•
Draft
[SPARK-53563][PS] Optimize: sql_processor by avoiding inefficient string concatenation
PANDAS API ON SPARK
PYTHON
#52322
opened Sep 12, 2025 by
petern48
Loading…
[SPARK-53323][CONNECT] Support df.asTable() for Arrow UDTF in Spark Connect
CONNECT
PYTHON
SQL
#52320
opened Sep 12, 2025 by
shujingyang-db
Loading…
[SPARK-53387][PYTHON] Add support for Arrow UDTFs with PARTITION BY
CORE
PYTHON
SQL
#52317
opened Sep 11, 2025 by
allisonwang-db
Loading…
[SPARK-53559][SQL][CATALYST] Fix HLL sketch updates to use raw collation key bytes
SQL
#52316
opened Sep 11, 2025 by
cboumalh
Loading…
[SPARK-53558][SQL] Show fully qualified table name including the catalog name in the exception message when the table is not found
SQL
#52315
opened Sep 11, 2025 by
ganeshashree
Loading…
[SPARK-53556][CONNECT] Avoid setting redundant struct data types in LiteralValueProtoConverter
CONNECT
SQL
#52312
opened Sep 11, 2025 by
heyihong
Loading…
Fix: SparkML-connect can't load SparkML (legacy mode) saved model
ML
PYTHON
#52311
opened Sep 11, 2025 by
WeichenXu123
Loading…
[SPARK-53553][CONNECT] Fix handling of null values in LiteralValueProtoConverter
CONNECT
SQL
#52310
opened Sep 11, 2025 by
heyihong
Loading…
[DOCS] Fix comma placement in array_append doc
SQL
#52309
opened Sep 11, 2025 by
rodrigoccurvo
Loading…
[SPARK-53551][SQL] Improve
OffsetAndLimit
by avoiding duplicate evaluation
SQL
#52307
opened Sep 11, 2025 by
beliefer
Loading…
[SPARK-53562][PYTHON] Limit Arrow batch sizes in
applyInArrow
and applyInPandas
CONNECT
CORE
PYTHON
SQL
#52303
opened Sep 11, 2025 by
zhengruifeng
•
Draft
[SPARK-53546][SQL][TESTS] Fix InMemoryDataSource to return default value or null for new fields
SQL
#52299
opened Sep 10, 2025 by
szehon-ho
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.