Apache Spark information

Apache Spark
Original author(s)	Matei Zaharia
Developer(s)	Apache Spark
Initial release	May 26, 2014; 9 years ago
Stable release	3.5.0 (Scala 2.13) / September 9, 2023; 7 months ago
Repository	Spark Repository
Written in	Scala
Operating system	Microsoft Windows, macOS, Linux
Available in	Scala, Java, SQL, Python, R, C#, F#
Type	Data analytics, machine learning algorithms
License	Apache License 2.0
Website	spark.apache.org

Apache Spark is an open-source unified analytics engine for large-scale data processing. Spark provides an interface for programming clusters with implicit data parallelism and fault tolerance. Originally developed at the University of California, Berkeley's AMPLab, the Spark codebase was later donated to the Apache Software Foundation, which has maintained it since.

^ "Spark Release 2.0.0". MLlib in R: SparkR now offers MLlib APIs [..] Python: PySpark now offers many more MLlib algorithms"

[1] "Spark Release 2.0.0". MLlib in R: SparkR now offers MLlib APIs [..] Python: PySpark now offers many more MLlib algorithms"

Apache Spark information

and 29 Related for: Apache Spark information

Apache Spark

Apache Kafka

Apache ZooKeeper

Graph Query Language

Ali Ghodsi

Matei Zaharia

Apache Parquet

List of Apache Software Foundation projects

Reynold Xin

Databricks

Apache Pig

Apache ORC

Apache Mahout

Holden Karau

Apache Avro

Apache Hadoop

Apache Arrow

Apache POI

Ion Stoica

Apache Beam

Spark

XGBoost

AMPLab

Apache SystemDS

MapR

Apache Samza

Hierarchical Data Format

Bzip2

Hortonworks