About 65,400 results
Open links in new tab
  1. Apache Spark™ - Unified Engine for large-scale data analytics

    Apache Spark is a multi-language engine for executing data engineering, data science, and machine learning on single-node machines or clusters.

  2. Documentation - Apache Spark

    Apache Spark™ Documentation Setup instructions, programming guides, and other documentation are available for each stable version of Spark below: Spark

  3. Downloads - Apache Spark

    Spark docker images are available from Dockerhub under the accounts of both The Apache Software Foundation and Official Images. Note that, these images contain non-ASF software …

  4. PySpark Overview — PySpark 4.0.1 documentation - Apache Spark

    Spark Connect is a client-server architecture within Apache Spark that enables remote connectivity to Spark clusters from any application. PySpark provides the client for the Spark …

  5. Overview - Spark 4.0.1 Documentation

    If you’d like to build Spark from source, visit Building Spark. Spark runs on both Windows and UNIX-like systems (e.g. Linux, Mac OS), and it should run on any platform that runs a …

  6. Migration Guide: Spark Core - Spark 4.0.0 Documentation

    Since Spark 4.0, spark.shuffle.service.db.backend is set to ROCKSDB by default which means Spark will use RocksDB store for shuffle service. To restore the behavior before Spark 4.0, …

  7. Structured Streaming Programming Guide - Spark 4.0.1 …

    In this model, Spark is responsible for updating the Result Table when there is new data, thus relieving the users from reasoning about it. As an example, let’s see how this model handles …

  8. CSV Files - Spark 3.5.5 Documentation

    Spark SQL provides spark.read().csv("file_name") to read a file or directory of files in CSV format into Spark DataFrame, and dataframe.write().csv("path") to write to a CSV file.

  9. News | Apache Spark

    Sep 24, 2025 · We’re proud to announce the release of Spark 0.7.0, a new major version of Spark that adds several key features, including a Python API for Spark and an alpha of Spark …

  10. Spark Release 3.5.5 - Apache Spark

    Dependency changes While being a maintenance release we did still upgrade some dependencies in this release they are: [SPARK-50886]: Upgrade Avro to 1.11.4 You can …