Loading…
Attending this event?
4 - 5 December 2024 | Geneva, Switzerland
View More Details & Registration

The Sched app allows you to build your schedule but is separate from your event registration. You must be registered for Cephalocon 2024 to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

This schedule is automatically displayed in Central European Time. To see the schedule in your preferred timezone, select from the drop-down menu located at the bottom of the menu to the right.
Session Presentation clear filter
arrow_back View All Dates
Wednesday, December 4
 

11:00 CET

Beyond Particle Physics: The Impact of Ceph and OpenStack on CERN's Multi-Datacenter Cloud Strategy - Enrico Bocchi, CERN, European Organization for Nuclear Research & Jose Castro Leon, CERN
Wednesday December 4, 2024 11:00 - 11:35 CET
CERN IT operates a large-scale storage and computing infrastructure at the service of scientific research and its user community: Ceph provides block, object, and file storage at a scale of 100 PBs, while OpenStack provisions bare-metal nodes, VMs, and virtual networking managing more than 450k CPUs. With the advent of a new computing center a few kilometers away from the main campus, compute and storage resources have been re-imagined to extend the capabilities offered by the infrastructure, while putting upfront clear design choices to favor availability and ease of operations. . In this presentation we report on how the new computing center was designed to host cohesively compute and storage resources, how integration with the existing computing center was achieved, and which new capabilities have been unlocked thanks to the newly-built DC. For Ceph in particular, we share insights on achieving data locality with compute resources, deploying a multi-site object storage service, and running a CephFS service that spans across both data centers.
Speakers
avatar for Jose Castro Leon

Jose Castro Leon

Cloud Technical Leader, CERN, European Organization for Nuclear Research
Jose is the Technical Leader for the CERN Cloud Infrastructure Service. He holds a Msc in Computer Science from Universidad de Oviedo. He joined CERN in 2010 and since then he was been working in virtualisation first and then he become part of the cloud team who build the CERN's OpenStack-based... Read More →
avatar for Enrica Porcari

Enrica Porcari

Head Of Information Technology Department, CERN
Enrica Maria Porcari is Head of the IT Department at CERN.Previously Enrica was the UN World Food Programme’s Chief Information Officer and Director of Technology and Chair of the UN’s Emergency Telecommunications Cluster. In this role Enrica drove WFP to be the leading edge of... Read More →
Wednesday December 4, 2024 11:00 - 11:35 CET
SG Auditorium C
  Session Presentation
  • Audience Level Any

11:00 CET

Ceph NVMe-of Road Map - Orit Wasserman & Mike Burkhart, IBM
Wednesday December 4, 2024 11:00 - 11:35 CET
Discover the current status and our future plans for the Ceph NVMe-oF gateway. This session will cover the latest developments, highlighting new features focusing on security and performance enhancements. Join us to see what’s next for Ceph NVMe-oF!
Speakers
avatar for Mike Burkhart

Mike Burkhart

Technical Product Manager - IBM Storage Ceph, NVMe/TCP and VMware Integration, IBM
Mike is a 25 year veteran of the IT data center space, spanning software development and testing, data center and hybrid cloud architecture, as well as now product management. Currently he collaborates with the brilliant engineers who develop Ceph to bring new features to opensource... Read More →
avatar for Orit Wasserman

Orit Wasserman

Distingiushed Engineer, IBM
Orit is a Distinguished Engineer at IBM, specializing in Software Defined Storage (Ceph) and storage for containerized apps (OpenShift Data Foundation) as well as hybrid/multi-cloud. With a strong background as a software engineer and architect, Orit's passion lies in open-source... Read More →
Wednesday December 4, 2024 11:00 - 11:35 CET
SG Auditorium B
  Session Presentation
  • Audience Level Any

11:00 CET

Enhancing Observability and Monitoring for Large Ceph Clusters at Scale - Filipp Akinfiev, Clyso GmbH
Wednesday December 4, 2024 11:00 - 11:35 CET
Maintaining performance and reliability in large Ceph clusters, especially with Rados Gateway (RadosGW), is challenging. Traditional observability approaches often generate excessive data without providing actionable insights. This speech introduces an advanced observability architecture that combines basic monitoring with on-demand detailed and event-triggered monitoring, ensuring continuous visibility and dynamic responsiveness. We'll explore: - The four-layer architecture, detailing producers, the NATS messaging backbone, and consumers. - Monitoring Techniques, including basic, on-demand, and event-triggered monitoring. - A Case Study, sharing proof of concept insights and lessons learned. - Future Directions, discussing potential advancements. This presentation is designed for cloud infrastructure engineers, SREs, and DevOps professionals looking to implement a scalable observability framework that improves system health and performance.
Speakers
avatar for Filipp Akinfiev

Filipp Akinfiev

Senior System Architect, Clyso GmbH
With over 30 years of experience in software development and system administration, I possess a broad technical knowledge in software and system architecture, as well as integration architectures. I specialize in designing and implementing forward-thinking, high-performance software... Read More →
Wednesday December 4, 2024 11:00 - 11:35 CET
SG Auditorium A
  Session Presentation
  • Audience Level Any

11:40 CET

Keeping Ceph RGW Object Storage Consistent - Jane Zhu, Bloomberg
Wednesday December 4, 2024 11:40 - 12:15 CET
Data powers Bloomberg’s financial products. Ceph clusters are the backbone of Bloomberg’s internal S3 cloud storage systems, which host this data and serve billions of requests a day.During the intensive usage of the Ceph RGW object storage with multi-site settings, we encountered different types of data inconsistencies, such as bucket-index and RADOS object inconsistency, unfinished transactions, and multi-site replication inconsistency, etc. These inconsistencies may potentially be caused by software bugs, race conditions, system timeout, and other reasons. Since we cannot guarantee the system is always bug-free and operating smoothly, it’s crucial that we can identify the inconsistency – should it happen – and fix or report it. While there are existing tools and code in place to help address some of these issues, their usage has limitations. As such, we are proposing a scalable and extensible bucket scrubbing approach to systematically check and identify and fix any inconsistency in the RGW object storage system at the bucket-level, if possible. This talk will discuss the design of this bucket scrubbing system and a prototype of it that we are implementing at Bloomberg.
Speakers
avatar for Jane Zhu

Jane Zhu

Senior Software Engineer, Bloomberg
Dr. Jane Zhu is a Senior Software Engineer in the Storage Engineering team at Bloomberg. Jane and her team designed and built a highly available, scalable, and durable software-defined cloud storage platform inside the Bloomberg ecosystem. Jane worked in the industry for more than... Read More →
Wednesday December 4, 2024 11:40 - 12:15 CET
SG Auditorium C
  Session Presentation
  • Audience Level Any

11:40 CET

Pull Requests and Reviews for Good - Gregory Farnum & Sam Just, IBM
Wednesday December 4, 2024 11:40 - 12:15 CET
You’ve built an amazing new feature for your Ceph use case and want to share it with the world. Now what? You make a pull request! Learn how to prepare and present your new work effectively and successfully within the Ceph community, and understand what you can expect from other contributors who are reviewing it. 
Conversely, you are asked to review a PR, and maybe it needs work. How do you share feedback in a way that it’s heard and handled? Which feedback is worth sharing at which stage of the process? And what are you promising to everybody else when you provide that “Reviewed-by” tag? This talk is for anybody who writes or reviews code (or wants to!) in the Ceph project. Prepared and presented by two of Ceph’s original four tech leads, hear about pitfalls and tips developed submitting PRs, mentoring new developers, and reviewing code from drive-by contributors, Ceph startups, consultancies, and future maintainers!
Speakers
avatar for Sam Just

Sam Just

Engineer, IBM
Sam began working on the Ceph project in 2011. Most of his time currently is spent working on crimson, the next generation ceph-osd implementation.
avatar for Gregory Farnum

Gregory Farnum

CephFS Engineering Manager, IBM
Greg Farnum has been in the core Ceph development group since 2009. Greg has contributed major work to CephFS and RADOS, contributed foundational work in the early days of RBD and RGW, previously served as the CephFS tech lead, and now manages IBM’s CephFS development team while... Read More →
Wednesday December 4, 2024 11:40 - 12:15 CET
SG Auditorium B

11:40 CET

The SMB Report Card - John Mulligan, IBM
Wednesday December 4, 2024 11:40 - 12:15 CET
Is 2024 the year of SMB on Ceph? It was for me. In his talk I will discuss the progress of our effort to add managed SMB suport to Ceph. Focusing primarily on the Orchestration aspects of this work, I will talk about some of the major steps it took to integrate Samba with Ceph, some of the projects outside Ceph that help make it happen, and some of our future plans. We will look into the commands needed to set up an SMB Cluster and Shares and demonstrate the workflow involved in connecting some Windows and Linux clients to an SMB Share using Active Directory authentication.
Speakers
avatar for John Mulligan

John Mulligan

Software Developer, IBM
John Mulligan is a developer at IBM working on the Ceph team. John's current focus is adding SMB support to Ceph. In addition to SMB/Samba John is interested in topics including Containers, Python, and Orchestration.
Wednesday December 4, 2024 11:40 - 12:15 CET
SG Auditorium A

13:45 CET

Session To Be Announced - Nathan Goulding, Vultr
Wednesday December 4, 2024 13:45 - 14:00 CET
Speakers
avatar for Nathan Goulding

Nathan Goulding

Senior Vice President, Engineering, Vultr
Nathan Goulding is an entrepreneurial-minded, product-focused technical leader with over 20 years of infrastructure, platform, and software as-a-service experience. As SVP, Engineering at Vultr, Nathan leads the engineering and technical product management teams. Prior to Vultr, Nathan... Read More →
Wednesday December 4, 2024 13:45 - 14:00 CET
SG Auditorium C

13:45 CET

Bringing a Ceph Based Enterprise Email System Into the Field - Danny Al-Gaaf, Deutsche Telekom AG
Wednesday December 4, 2024 13:45 - 14:20 CET
Deutsche Telekom operates a growing email system with several million accounts and billions of emails stored on traditional NFS. Six years ago we introduced librmb (librados mailbox) to the community, a universal open source library to store emails in a Ceph cluster. Librmb uses RADOS to store email directly in Ceph to achieve maximum performance through parallel access from many email gateways simultaneously, for millions of active customers. Email systems are much too complex to be simulated in a way which would allow to verify if the switch to librmb will work for a large number of users. Therefore a field test with active customers was necessary to provide an educated guess on the behavior of the final setup. This presentation will cover the results from artificial and real field tests with more than 1 million accounts/users. The results include the experience and learnings of migrating from the existing email system into Ceph, an extended time of running the system and from migrating the accounts out of the test system. We will provide an insight into our learnings, found issues, potential solutions and an outlook into our next steps towards a Ceph based email system.
Speakers
avatar for Danny Al-Gaaf

Danny Al-Gaaf

Senior Cloud Technologist, Deutsche Telekom AG
Danny Al-Gaaf is a Senior Cloud Technologist working for Deutsche Telekom. As a Ceph upstream developer he is a driver for using Ceph at Deutsche Telekom. For the last 15 years his professional focus has been on Linux and open source. He works actively in several upstream communities... Read More →
Wednesday December 4, 2024 13:45 - 14:20 CET
SG Auditorium A

13:45 CET

Crimson Project Update - Matan Breizman & Aishwarya Mathuria, IBM
Wednesday December 4, 2024 13:45 - 14:20 CET
The Crimson project is an effort to build a replacement ceph-osd well suited to the new reality of low latency, high throughput, persistent memory and NVMe technologies. Built on the Seastar C++ framework, crimson-osd aims to be able to fully exploit modern devices by minimizing latency, CPU overhead, and cross-core communication. This talk will discuss the current state of Crimson going into the Tentacle release.
Speakers
avatar for Aishwarya Mathuria

Aishwarya Mathuria

Senior Software Engineer, IBM
avatar for Matan Breizman

Matan Breizman

Crimson's Tech Lead, IBM
Matan is Crimson's tech lead, which is the next generation Ceph OSD. Part of the Core Ceph development group since 2021.
Wednesday December 4, 2024 13:45 - 14:20 CET
SG Auditorium B

14:25 CET

Conditional End2end Tracing - Yuval Lifshitz, IBM & Deepika Upadhyay, Clyso GmBH
Wednesday December 4, 2024 14:25 - 15:00 CET
End to end tracing can help debug latency issues between the RGW and the OSD, giving a complete picture of request flow, but tracing itself has a performance impact on the system. When your production system runs into issues, the last thing you want is to put it under more pressure! In this session we will demonstrate how to use Lua scripting on the RGW to turn opentelemetry based tracing only for some of the incoming requests. Allowing us to focus on the problem without slowing down the entire system.
Speakers
avatar for Yuval Lifshitz

Yuval Lifshitz

Senior Technical Staff Member, IBM
Yuval Lifshitz works as a Senior Technical Staff Member at IBM. His current focus is enriching the Ceph ecosystem by adding connectivity between the Rados Object Gateway and external message brokers (Kafka, Knative, RabbitMQ, etc.). He also added Lua scripting into the Rados Object... Read More →
avatar for Deepika Upadhyay

Deepika Upadhyay

Ceph Engineer, Clyso GmBH
Deepika is currently working as a Ceph Engineer at Clyso GmBH and is a contributor to Ceph and Rook project, she has worked as an Outreachy intern for Ceph with focus on adding tracing to Ceph OSD. She has also worked as a RADOS and RBD(Block based storage) teams and now working with... Read More →
Wednesday December 4, 2024 14:25 - 15:00 CET
SG Auditorium B

14:25 CET

The Art of Teuthology - Patrick Donnelly, IBM, Inc.
Wednesday December 4, 2024 14:25 - 15:00 CET
The Ceph project has used the Teuthology testing framework for much of its history. The custom framework is used to schedule batch jobs that perform e2e testing of Ceph. This is orchestrated using a suite of YAML fragments to alternate test modes, configurations, workloads, and other parameters. Teuthology assembles these fragments into a static matrix with potentially dozens of dimensions ultimately producing a combinatoric explosion of jobs which are evaluated, in practice, as smaller subsets for scheduling. We will explore an alternative directed graph model for constructing jobs from a suite of YAML fragments using path walks. Code adapted to this model has been constructed to produce subsets in linear time and provide Lua scriptable control of YAML fragment generation. The latter new feature empowers us to test Ceph with more rigor and completeness. For example, upgrade suites can be constructed using all possible versions of Ceph that are valid upgrade paths to a target release. We will explore this and other enhancements in depth. The audience can expect to leave with a firm and visual understanding of how QA is performed on Ceph and a vision for the future testing.
Speakers
avatar for Patrick Donnelly

Patrick Donnelly

Software Architect, IBM, Inc.
Patrick Donnelly is a Software Architect at IBM, Inc. working as part of the global development team on the open source Ceph distributed storage system. Patrick has principally worked on the Ceph file system (CephFS) since 2016. He has been working on Open Source projects for the... Read More →
Wednesday December 4, 2024 14:25 - 15:00 CET
SG Auditorium C

15:05 CET

Exploring RocksDB in RGW: How We Manage Tombstones - Sungjoon Koh, LINE Plus
Wednesday December 4, 2024 15:05 - 15:40 CET
LINE, a global mobile messenger, has adopted Ceph as its main object storage. It is used to store different kinds of data, such as log files and application data. Thanks to its scalability, billions of objects are stored in our clusters. However, over time, object deletions lead to the accumulation of tombstones in RocksDB, resulting in delays during iteration. Slow iteration not only impacts LIST operation but also stalls subsequent requests. To address this issue, we first collected RocksDB metric called "skip count", which indicates the total number of tombstones detected during iterations. We then deployed a new job which compacts OSDs with high skip counts to prevent stalls. Additionally, we analyzed the pattern of tombstones and found out that a few prefixes account for over 80% tombstones, throughout the entire OSD. Based on this observation, we propose range-based compaction. In this presentation, we will first explain the basics of RocksDB and its role in Ceph Object Storage. Then, we will share our experience how we handled the RocksDB issue. Lastly, we will discuss our proposal for range-based compaction, which could further optimize overall system performance.
Speakers
avatar for Sungjoon Koh

Sungjoon Koh

Cloud Storage Engineer, LINE Plus
Sungjoon Koh is a cloud storage engineer at LINE Plus Corporation, focusing on object storage and NVMe-oF-based block storage services. His current interests include enhancing Ceph's compatibility with the S3 standard and developing object migration features. Before joining LINE Plus... Read More →
Wednesday December 4, 2024 15:05 - 15:40 CET
SG Auditorium C

15:05 CET

RBD in Squid and Beyond - Ramana Krisna Venkatesh Raja, IBM Canada Ltd & Prasanna Kumar Kalever, IBM
Wednesday December 4, 2024 15:05 - 15:40 CET
This talk will provide an overview of the new features and notable improvements in Ceph's block device component, RBD, in the Squid release. We will discuss topics such as the new feature to mirror RBD groups, improvements in live-migrating RBD images, various performance optimizations in RBD, and improved support for Windows. The session will also cover what's next for RBD in the Tentacle release. The goal is to keep new and experienced RBD users up-to-date with the latest that RBD has to offer and future plans for RBD.
Speakers
avatar for Prasanna Kumar Kalever

Prasanna Kumar Kalever

IBM, Software Architect, IBM
Prasanna Kumar Kalever works as a Software Architect at IBM and is a member of Ceph RBD team. Ex-RedHatter, Author of block storage support on Gluster which kick started RedHat's Openshift Data Foundation, also instrumental to its integration with Kubernetes. His contributions include... Read More →
avatar for Ramana Raja

Ramana Raja

Senior Software Engineer, IBM Canada Ltd
I am a developer working on Ceph's RBD component with a focus on RBD mirroring. I have made numerous code contributions to RBD and CephFS components of the Ceph project. I was previously the maintainer of the CephFS's driver for the Open Stack Manila project. I've also contributed... Read More →
Wednesday December 4, 2024 15:05 - 15:40 CET
SG Auditorium A

16:00 CET

Ceph at 20 Years! Still the Best for Modern Storage - Dan van der Ster, CLYSO
Wednesday December 4, 2024 16:00 - 16:35 CET
This talk explores why Ceph is the best software-defined storage solution available, highlighting its evolution since 2004 and its leadership over today's alternatives. Ceph uses innovative technologies to stay relevant. It offers block, object, and file storage through a unified system, reducing complexity and management overhead. CRUSH and Placement Groups provide scalability and resilience, allowing Ceph clusters to span hardware generations without disruptive migrations. BlueStore enhances performance with flexible replication and erasure coding. Stretch clusters and mirroring enable robust disaster recovery. Scale-out metadata in RGW and CephFS support performance for AI workloads. Ceph remains trendy with top integrations for on-premises cloud platforms, thanks to its pluggable architecture and community contributions. It's free, open-source, vendor-free, and easily installable with orchestration tools. With competition closing in, Ceph's community must innovate to stay ahead. We'll offer insights from today's toughest storage requirements, suggesting technical evolutions for the OSD, RGW, and MDS to keep Ceph at the forefront of software-defined storage solutions.
Speakers
avatar for Dan van der Ster

Dan van der Ster

CTO, CLYSO
Dan is CTO for CLYSO, developing and supporting solutions with Ceph, open infrastructure, and cloud native products and services. Dan contributes to the open source Ceph Foundation and community as Executive Council Member since 2021 and Board Member since 2015. Previously Dan was... Read More →
Wednesday December 4, 2024 16:00 - 16:35 CET
SG Auditorium A
  Session Presentation
  • Audience Level Any

16:00 CET

Revisiting Ceph's Performance After 4 Years - Wido den Hollander, Your.Online
Wednesday December 4, 2024 16:00 - 16:35 CET
As new generations of hardware become available and Ceph is improved, how does it's performance change? If we look back 4 years, how did Ceph's performance improve (or not)?
Speakers
avatar for Wido den Hollander

Wido den Hollander

CTO, Your.Online
Wido has been a part of the Ceph community for over 10 years. Long time user, developer and advocate of the future of storage. He has worked as Ceph consultant and trainer and is now CTO of Your.Online, a European-based hosting group with companies throughout Europe and a large Ceph... Read More →
Wednesday December 4, 2024 16:00 - 16:35 CET
SG Auditorium C

16:40 CET

Ceph Manager Module Design and Operation, an in-Depth Review - Brad Hubbard, Redhat & Prashant Dhange, IBM Canada Ltd.
Wednesday December 4, 2024 16:40 - 17:15 CET
This session will cover overall ceph manager design and operational aspects of the ceph MGR daemon. We will begin by giving an introduction to the MGR architecture, move on to discussing functionality of the mgr DaemonServer, mgr client, python module registry, base mgr module, and loading and unloading of the mgr modules. We will then move on to discuss module debugging, an example of GIL deadlock debugging, and how to troubleshoot MGR bugs and plugin issues. Finally, we discuss new features including tracking mgr ops and further improvements planned for future releases.
Speakers
avatar for Prashant Dhange

Prashant Dhange

Ceph rados core engineer, IBM Canada Ltd.
With 15+ years of experience in storage and cloud computing, Prashant is a experienced professional with a strong background in system programming. Prashant's focus lies in developing and optimizing storage solutions, particularly through his in-depth work with Ceph RADOS, a pivotal... Read More →
avatar for Brad Hubbard

Brad Hubbard

Principal Software Engineer, Redhat
Involved in supporting and contributing to the ceph project for well over ten years. Most recently as a RADOS core engineer working on features and bugs, both upstream and down, as well as advocating for the customer and expediting their issues internally. I have a passion for complex... Read More →
Wednesday December 4, 2024 16:40 - 17:15 CET
SG Auditorium C

16:40 CET

Maximizing the Value of Your Rados Gateway with Ingress Strategies - Michaela Lang, Red Hat & Daniel Parkes, IBM
Wednesday December 4, 2024 16:40 - 17:15 CET
Drawing from the insights gained from attending Ceph Days 2022, Customers previous talks, and my personal experience with ServiceMesh deployments, I have observed that many organizations need help implementing a cluster-wide rate limiting and metrics visibility across their RGW and Buckets. We discover a range of exciting use cases as we explore Envoy's capabilities in rate limiting, filtering, and collecting metrics on S3 activities executed against RGW. These include header-based filtering for multi-region deployments, additional OAuth token enforcement, and the ability to monitor user-to-bucket metrics without post-processing command outputs at ease. To illustrate these capabilities, I will lead a hands-on lab demonstration, showcasing Envoy's frontend role for RGW on various use cases. Rate limiting S3 requests per client Rate limiting S3 requests per region Rate limiting S3 requests per user/address/bucket (more granular level of control) Utilize Prometheus for metrics collection and monitoring Examine the use of geo-regional traffic flow scenarios in RGW Examine the use of traffic stream replication for disaster recovery scenarios in RGW
Speakers
avatar for Michaela Lang

Michaela Lang

Ms, Red Hat
Born in Vienna in 1977 started with Red Hat Linux 6 in 99 and managed to put my fingers on nearly all technoligies now landed at Red Hat where I even get paid for doing things I love to do.
avatar for Daniel Parkes

Daniel Parkes

IBM Storage Ceph Technical Product Manager, IBM
Daniel Parkes has been a die-hard Infrastructure enthusiast for many years with a massive passion for open-source technologies and a keen eye for innovation. Daniel is working in the IBM Storage Ceph Product Management team, focusing on the IBM Storage Ceph Object Storage offering... Read More →
Wednesday December 4, 2024 16:40 - 17:15 CET
SG Auditorium A
  Session Presentation
  • Audience Level Any

16:40 CET

The Challenge of Storing Small Objects on a Large Scale - Luis Domingues & Ján Senko, Proton AG
Wednesday December 4, 2024 16:40 - 17:15 CET
As an online sevices provider, storage is a critical part of Proton. With customers all around the world, exchanging e-mails, and backing up their data, the storage stack needs to be accessible 24/7. In this talk we will share the challeges of managing some Ceph clusters to serve those customers. How we manage 100PiB of small objects accross 6'000+ OSDs. Some experiances we tried with OMAP. And what we do to always be online.
Speakers
avatar for Luis Domingues

Luis Domingues

Storage Engineer, Proton AG
Luis Domingues graduated from HES-SO on distributed IT systems. After a few years at Kudelski group, he joint proton where he works now as a storage engineer.
avatar for Ján Senko

Ján Senko

Head of Storage, Proton AG
Ján has founded the Storage department at Proton, pioneered Ceph and is responsible for several types of data Storage encompassing more than 100PB of data. Luis is a Ceph Engineer responsible for keeping our production Ceph clusters running smoothly.
Wednesday December 4, 2024 16:40 - 17:15 CET
SG Auditorium B

17:20 CET

Improving Ceph Economics with QAT Hardware Offload - Philip Williams, Canonical
Wednesday December 4, 2024 17:20 - 17:55 CET
Ceph, the words most popular open source software defined storage system, has offered storage efficiency features such as block device compression, object compression and server-side object encryption for a number of releases. However, enabling these features has always come as a trade-off between the additional performance required (in terms of cores/GHz) vs the raw storage cost, ultimately driving users away from these features. In this talk we will walk through several different scenarios where Intel's QAT offload is used to enable these features without significant overhead to primary processing, and still yields greater performance without causing increased cost per GB.
Speakers
avatar for Philip Williams

Philip Williams

Product Manager, Canonical
Philip is a Product Manager at Canonical responsible for Ceph and other storage solutions. He has over 18 years experience in the storage industry, having previously been responsible for storage infrastructure and products at a number of leading technology companies.
Wednesday December 4, 2024 17:20 - 17:55 CET
SG Auditorium B

17:20 CET

Supporting 3 Availability Zones Stretch Cluster - Kamoltat (Junior) Sirivadhna, IBM
Wednesday December 4, 2024 17:20 - 17:55 CET
A Ceph cluster stretched across 3 zones faces a potential scenario where data loss can occur due to unforeseeable circumstances. An example of such a scenario is when we have 6 replicas spread across 3 datacenters with a min_size of 3 and the setup is intended to prevent I/O from happening when there is only 1 datacenter available, however, there is an edge case where a placement group (PG) becomes available due to a lack of safeguarding during the process of temporary PG mappings in order ensure data availability. This scenario poses a risk when the sole surviving data center accepts writes, and then suddenly the 2 unavailable data centers come back up. At the same time, the surviving data center suddenly goes down, which means we would have a data loss situation. To prevent such a scenario from happening, we created a solution that utilizes an existing feature in stretch mode that would restrict how we choose the OSDs that would go into the acting set of a PG. This talk will take a deep dive into how this feature is implemented in the latest Ceph upstream as well as other features that improve the user experience with stretch cluster in the latest Ceph upstream release.
Speakers
avatar for Kamoltat (Junior) Sirivadhna

Kamoltat (Junior) Sirivadhna

Software Engineer RADOS, IBM
Junior has been a Ceph contributor for 4 years, some of his work includes enhancing Stretch Mode/ Stretch Cluster features in Ceph and improving the PG auto scaler module. Furthermore, he also contributes to the enhancement of Teuthology, a Ceph Integration testing framework that... Read More →
Wednesday December 4, 2024 17:20 - 17:55 CET
SG Auditorium C
  Session Presentation
  • Audience Level Any
 
  • Filter By Date
  • Filter By Venue
  • Filter By Type
  • Audience Level
  • Session Slides Attached
  • Timezone

Share Modal

Share this link via

Or copy link

Filter sessions
Apply filters to sessions.
Filtered by Date -