Oak: A Scalable Off-Heap Allocated Key-Value Map (PPoPP 2020 - Main Conference)

Who

Hagar Meir, Edward Bortnikov, Anastasia Braginsky, Dmitry Basin, Yonatan Gottesman, Eshcar Hillel, Idit Keidar, Eran Meir, Gali Sheffi

Track

PPoPP 2020 Main Conference

Time Zone

The program is currently displayed in (GMT-08:00) Tijuana, Baja California.

Use conference time zone: (GMT-08:00) Tijuana, Baja CaliforniaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Mon 24 Feb 2020 10:00 - 10:25 - Key Value Store (Mediterranean Ballroom) Chair(s): Milind Chabbi

Abstract

Efficient ordered in-memory key-value (KV-)maps are paramount for the scalability of modern data platforms. In managed languages like Java, KV-maps face unique challenges due to the high overhead of garbage collection (GC).

We present Oak, a scalable concurrent KV-map for environments with managed memory. Oak offloads data from the managed heap, thereby reducing GC overheads and improving memory utilization. An important consideration in this context is the programming model since a standard object-based API entails moving data between the on- and off-heap spaces. In order to avoid the cost associated with such movement, Oak introduces a novel zero-copy (ZC) API alongside the traditional one (e.g., Java’s ConcurrentNavigableMap). Oak allows concurrency among all map operations, offering atomic get, put, and various conditional put operations such as compute (in-situ update) and put-if-absent.

We have released an open-source Java implementation of Oak. We further present a prototype Oak-based implementation of the internal multidimensional index in Apache Druid – a popular open-source in-memory real-time analytics system. Our experiments show that Oak can be in many cases 2x faster than Java’s state-of-the-art concurrent skiplist.

Hagar Meir

IBM Haifa Research Lab

Edward Bortnikov

Yahoo Research

Anastasia Braginsky

Yahoo Research

Dmitry Basin

Yahoo Research

Yonatan Gottesman

Yahoo Research

Eshcar Hillel

Yahoo Research, Oath

Idit Keidar

Technion - Israel institute of technology

Israel

Eran Meir

Yahoo Research

Gali Sheffi

Technion - Israel

Time Zone

The program is currently displayed in (GMT-08:00) Tijuana, Baja California.

Use conference time zone: (GMT-08:00) Tijuana, Baja CaliforniaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Mon 24 Feb
Displayed time zone: Tijuana, Baja California change

09:35 - 10:25	Key Value Store (Mediterranean Ballroom)Main Conference Chair(s): Milind Chabbi Uber Technologies Inc.

09:35 25m Talk		Kite: Efficient and Available Release Consistency for the Datacenter Main Conference Vasilis Gavrielatos University of Edinburgh, UK, Antonios Katsarakis University of Edinburgh, UK, Vijay Nagarajan University of Edinburgh, UK, Boris Grot University of Edinburgh, UK, Arpit Joshi Intel
10:00 25m Talk		Oak: A Scalable Off-Heap Allocated Key-Value Map Main Conference Hagar Meir IBM Haifa Research Lab, Edward Bortnikov Yahoo Research, Anastasia Braginsky Yahoo Research, Dmitry Basin Yahoo Research, Yonatan Gottesman Yahoo Research, Eshcar Hillel Yahoo Research, Oath, Idit Keidar Technion - Israel institute of technology, Eran Meir Yahoo Research, Gali Sheffi Technion - Israel