This document discusses Alluxio, an open-source memory-centric distributed storage system that serves data from memory, SSDs, and HDDs to analytics and AI/ML frameworks. It provides a unified namespace and supports various storage systems. The architecture includes a master node for metadata and workflow management and worker nodes. Resiliency is enabled through an active-passive master configuration and lineage-based recovery of lost data. The system has been used in production with over 100 nodes managing 1+ PB of data and providing 30x performance improvements.