skip to main content
FAST '24: Proceedings of the 22nd USENIX Conference on File and Storage Technologies
2024 Proceeding
  • Others:
  • Xiaosong Ma,
  • Youjip Won
Publisher:
  • USENIX Association
  • 2560 Ninth St. Suite 215 Berkeley, CA
  • United States
Conference:
FAST'24: 22nd USENIX Conference on File and Storage Technologies Santa Clara CA USA February 27 - 29, 2024
ISBN:
978-1-939133-38-0
Published:
11 March 2024
Sponsors:
FUTUREWEI, NSF, Google Inc., IBM, Samsung

Reflects downloads up to 10 Nov 2024Bibliometrics
Abstract

No abstract available.

Skip Table Of Content Section
SECTION: Distributed storage
research-article
TeRM: extending RDMA-attached memory with SSD
Article No.: 1, Pages 1–16

RDMA-based in-memory storage systems offer high performance but are restricted by the capacity of physical memory. In this paper, we propose TeRM to extend RDMA-attached memory with SSD. TeRM achieves fast remote access on the SSD-extended memory by ...

research-article
Combining buffered I/O and direct I/O in distributed file systems
Article No.: 2, Pages 17–34

Direct I/O allows I/O requests to bypass the Linux page cache and was introduced over 20 years ago as an alternative to the default buffered I/O mode. However, high-performance computing (HPC) applications still mostly rely on buffered I/O, even if ...

research-article
OmniCache: collaborative caching for near-storage accelerators
Article No.: 3, Pages 35–50

We propose OmniCache, a novel caching design for near-storage accelerators that combines near-storage and host memory capabilities to accelerate I/O and data processing. First, OmniCache introduces a "near-cache" approach, maximizing data access to the ...

SECTION: Caching
research-article
Symbiosis: the art of application and kernel cache cooperation
Article No.: 4, Pages 51–70

We introduce Symbiosis, a framework for key-value storage systems that dynamically configures application and kernel cache sizes to improve performance. We integrate Symbiosis into three production systems - LevelDB, WiredTiger, and RocksDB - and, ...

research-article
Optimizing file systems on heterogeneous memory by integrating DRAM cache with virtual memory management
Article No.: 5, Pages 71–88

This paper revisits the usage of DRAM cache in DRAM-PM heterogeneous memory file systems. With a comprehensive analysis of existing file systems with cache-based and DAX-based designs, we show that both suffer from suboptimal performance due to excessive ...

research-article
Kosmo: efficient online miss ratio curve generation for eviction policy evaluation
Article No.: 6, Pages 89–106

In-memory caches play an important role in reducing the load on backend storage servers for many workloads. Miss ratio curves (MRCs) are an important tool for configuring these caches with respect to cache size and eviction policy. MRCs provide insight ...

SECTION: File Systems
research-article
I/O Passthru: upstreaming a flexible and efficient I/O path in Linux
Article No.: 7, Pages 107–122

New storage interfaces continue to emerge fast on Non-Volatile Memory Express (NVMe) storage. Fitting these innovations in the general-purpose I/O stack of operating systems has been challenging and time-consuming. The NVMe standard is no longer limited ...

research-article
Metis: file system model checking via versatile input and state exploration
Article No.: 8, Pages 123–140

We present Metis, a model-checking framework designed for versatile, thorough, yet configurable file system testing in the form of input and state exploration. It uses a nondeterministic loop and a weighting scheme to decide which system calls and their ...

research-article
RFUSE: modernizing userspace filesystem framework through scalable kernel-userspace communication
Article No.: 9, Pages 141–158

With the advancement of storage devices and the increasing scale of data, filesystem design has transformed in response to this progress. However, implementing new features within an in-kernel filesystem is a challenging task due to development ...

SECTION: Flash Storage
research-article
The design and implementation of a capacity-variant storage system
Article No.: 10, Pages 159–176

We present the design and implementation of a capacity-variant storage system (CVSS) for flash-based solid-state drives (SSDs). CVSS aims to maintain high performance throughout the lifetime of an SSD by allowing storage capacity to gracefully reduce ...

research-article
I/O in a flash: evolution of ONTAP to low-latency SSDs
Article No.: 11, Pages 177–192

Flash-based persistent storage media are capable of sub-millisecond latency I/O. However, a storage architecture optimized for spinning drives may contain software delays that make it impractical for use with such media. The NetApp® ONTAP® storage system ...

research-article
We ain't afraid of no file fragmentation: causes and prevention of its performance impact on modern flash SSDs
Article No.: 12, Pages 193–208

A few studies reported that fragmentation still adversely affects the performance of flash solid-state disks (SSDs) particularly through request splitting. This research investigates the fragmentation-induced performance degradation across three levels: ...

SECTION: Key-Value Systems
research-article
In-memory key-value store live migration with NetMigrate
Article No.: 13, Pages 209–224

Distributed key-value stores today require frequent key-value shard migration between nodes to react to dynamic workload changes for load balancing, data locality, and service elasticity. In this paper, we propose NetMigrate, a live migration approach ...

research-article
IONIA: high-performance replication for modern disk-based KV stores
Article No.: 14, Pages 225–242

We introduce IONIA, a novel replication protocol tailored for modern SSD-based write-optimized key-value (WO-KV) stores. Unlike existing replication approaches, IONIA carefully exploits the unique characteristics of SSD-based WO-KV stores. First, it ...

research-article
Physical vs. logical indexing with IDEA: inverted deduplication-aware index
Article No.: 15, Pages 243–258

In the realm of information retrieval, the need to maintain reliable term-indexing has grown more acute in recent years, with vast amounts of ever-growing online data searched by a large number of search-engine users and used for data mining and natural ...

research-article
MIDAS: minimizing write amplification in log-structured systems through adaptive group number and size configuration
Article No.: 16, Pages 259–276

Log-structured systems are widely used in various applications because of its high write throughput. However, high garbage collection (GC) cost is widely regarded as the primary obstacle for its wider adoption. There have been numerous attempts to ...

SECTION: Cloud Storage
research-article
What's the story in EBS glory: evolutions and lessons in building cloud block store
Article No.: 17, Pages 277–292

In this paper, we qualitatively and quantitatively discuss the design choices, production experience, and lessons in building the Elastic Block Storage (EBS) at ALIBABA CLOUD over the past decade. To cope with hardware advancement and users' demands, we ...

research-article
ELECT: enabling erasure coding tiering for LSM-tree-based storage
Article No.: 18, Pages 293–310

Given the skewed nature of practical key-value (KV) storage workloads, distributed KV stores can adopt a tiered approach to support fast data access in a hot tier and persistent storage in a cold tier. To provide data availability guarantees for the hot ...

research-article
MinFlow: high-performance and cost-efficient data passing for I/O-intensive stateful serverless analytics
Article No.: 19, Pages 311–328

Serverless computing has revolutionized application deployment, obviating traditional infrastructure management and dynamically allocating resources on demand. A significant use case is I/O-intensive applications like data analytics, which widely employ ...

SECTION: AI and Storage
research-article
COLE: a column-based learned storage for blockchain systems
Article No.: 20, Pages 329–346

Blockchain systems suffer from high storage costs as every node needs to store and maintain the entire blockchain data. After investigating Ethereum's storage, we find that the storage cost mostly comes from the index, i.e., Merkle Patricia Trie (MPT). ...

research-article
Baleen: ML admission & prefetching for flash caches
Article No.: 21, Pages 347–372

Flash caches are used to reduce peak backend load for throughput-constrained data center services, reducing the total number of backend servers required. Bulk storage systems are a large-scale example, backed by high-capacity but low-throughput hard ...

research-article
Seraph: towards scalable and efficient fully-external graph computation via on-demand processing
Article No.: 22, Pages 373–387

Fully-external graph computation systems exhibit optimal scalability by computing the ever-growing, large-scale graph with constant amount of memory on a single machine. In particular, they keep the entire massive graph data in storage and iteratively ...

Contributors
Index terms have been assigned to the content through auto-classification.

Recommendations