v2.9.3
/ March 24, 2023; 13 months ago (2023-03-24)[1]
Repository
https://github.com/Alluxio/alluxio
Written in
Java
Operating system
macOS, Linux
Available in
Java
License
Apache License 2.0
Website
www.alluxio.io
Alluxio is an open-source virtual distributed file system (VDFS). Initially as research project "Tachyon", Alluxio was created at the University of California, Berkeley's AMPLab as Haoyuan Li's Ph.D. Thesis,[2] advised by Professor Scott Shenker & Professor Ion Stoica. Alluxio sits between computation and storage in the big data analytics stack. It provides a data abstraction layer for computation frameworks, enabling applications to connect to numerous storage systems through a common interface. The software is published under the Apache License.
Data Driven Applications, such as Data Analytics, Machine Learning, and AI, use APIs (such as Hadoop HDFS API, S3 API, FUSE API) provided by Alluxio to interact with data from various storage systems at a fast speed. Popular frameworks running on top of Alluxio include Apache Spark, Presto, TensorFlow, Trino, Apache Hive, and PyTorch, etc.
Alluxio can be deployed on-premise, in the cloud (e.g. Microsoft Azure, AWS, Google Compute Engine), or a hybrid cloud environment. It can run on bare-metal or in a containerized environments such as Kubernetes, Docker, Apache Mesos.
^
Li, Haoyuan (7 May 2018). Alluxio: A Virtual Distributed File System (Technical report). EECS Department, University of California, Berkeley. UCB/EECS-2018-29.
Alluxio is an open-source virtual distributed file system (VDFS). Initially as research project "Tachyon", Alluxio was created at the University of California...
section "Caching: Managing Data Replication in Alluxio". "Caching: Managing Data Replication in Alluxio". "Erasure Code Profiles". "Pools". Satyanarayanan...
data orchestration system, Alluxio. He is the Founder, Chairman, and CEO of Alluxio, Inc, a company commercializing the Alluxio Data Orchestration Technology...
distributed storage, Spark can interface with a wide variety, including Alluxio, Hadoop Distributed File System (HDFS), MapR File System (MapR-FS), Cassandra...
Hadoop project and runs on top of HDFS (Hadoop Distributed File System) or Alluxio, providing Bigtable-like capabilities for Hadoop. That is, it provides...
Alma mater Carnegie Mellon University Known for Chord Apache Spark Apache Mesos Alluxio Awards ACM Fellow SIGOPS Mark Weiser Award Scientific career Fields Cloud...
DCE/DFS, WekaFS, Lustre, PanFS, Google File System, Mnet, Chord Project. Alluxio BeeGFS (Fraunhofer) CephFS (Inktank, Red Hat, SUSE) Windows Distributed...
Attributions SECINFO_NO_NAME 9P (protocol) – Plan 9 Filesystem Protocol Alluxio Andrew File System BeeGFS, the parallel file system CacheFS – a caching...
provide a SQL interface and multi-dimensional analysis (OLAP) on Hadoop and Alluxio supporting extremely large datasets. It was originally developed by eBay...
many know it as the lab that invented Apache Mesos, and Apache Spark, and Alluxio. Berkeley launched RISELab as the successor to AMPLab in 2017. "AMPLab...
Hadoop's HDFS and compatible file systems such as Amazon S3 filesystem and Alluxio. It provides a SQL-like query language called HiveQL with schema on read...
system with billions of files and complete in a few hours.[citation needed] Alluxio ASM Cluster File System (ACFS) BeeGFS GFS2 Gluster Google File System List...
Name By License OS Description Alluxio UC Berkeley, Alluxio Apache License cross-platform An open-source virtual distributed file system (VDFS). BeeGFS...
temperature" or activity levels determines the primary storage hierarchy. Alluxio AMASS/DATAMGR from ADIC (Was available on SGI IRIX, Sun and HP-UX) IBM...
connection to a similar virtual database layer.[clarification needed] Alluxio, an open-source virtual distributed file system (VDFS), started at the...
input and output operators provide templates to sources and sinks such as Alluxio, S3, HDFS, NFS, FTP, Kafka, ActiveMQ, RabbitMQ, JMS, Cassandra, MongoDB...