This event has ended. Create your own event on Sched.

Sign up or log in to bookmark your favorites and sync them to your phone or calendar.

Developer [clear filter]
Wednesday, August 31

11:30am PDT

Marathon and Chronos - State of the Art - Matthias Eichstedt & Matthias Veit, Mesosphere
Marathon and Chronos are the core and essential frameworks for most Apache Mesos clusters.
Marathon ensures long running tasks are running in the datacenter and is commonly used to maintain high availability of other frameworks. Chronos is the fault tolerant cron of the datacenter.
There have been significant advances in both of these frameworks to leverage the latest features of Apache Mesos including persistent storage in addition to extending the capabilities of the system such as networking and security.
This session will provide an overview of all the new features along with experience from the field and best practices for using these features in production. The session will conclude with a look at the roadmap for each of these projects providing a glimpse into what to expect over the next year.

avatar for Matthias Eichstedt

Matthias Eichstedt

Engineering Manager, Mesosphere
I'm Engineering Manager of the Orchestration team at Mesosphere, based in Hamburg, Germany.
avatar for Matthias Veit

Matthias Veit

Mesosphere, Engineering Manager

Wednesday August 31, 2016 11:30am - 12:20pm PDT
Ballroom B

4:20pm PDT

DC/OS: Considerations and Best Practices for Production Deployments - Ben Lin, Mesosphere
DC/OS makes it easy to quickly setup and run a Mesos cluster but the default configuration may not be as battle-hardened as one would like or uniquely tailored to meet your requirements and constraints. In this session, Mesosphere consultants will discuss key customer requirements and the architecture decisions that lead to a production ready, resilient, and scalable DC/OS design. This session will cover Capacity Planning, Monitoring, Physical Host Considerations, Security, Container Data Management, and Backup and Recovery. The session will also cover the unique elements that need to be considered when deploying DC/OS in Production vs. Dev/Test configurations.


Ben Lin

Solutions Architect, Mesosphere
Ben Lin is a Solutions Architect at Mesosphere. Ben works closely with customers to develop strategies for architecting, deploying, and operationalizing DC/OS environments. Previously he worked in the VMware NSBU (Networking & Security Business Unit), focusing on advanced solutions... Read More →

Wednesday August 31, 2016 4:20pm - 5:10pm PDT
Ballroom B

5:20pm PDT

Building Highly Available Mesos Frameworks, 2.0 - Neil Conway, Mesosphere
Production-quality Mesos frameworks must be able to continue managing tasks despite unreliable networks and faulty computers. Mesos provides tools to help developers do fault-tolerant task management, but putting these tools together effectively remains something of a black art. This talk will offer practical guidance to framework developers to help them understand how Mesos deals with failures and the tools it provides to enable fault tolerant frameworks. The talk will also cover new Mesos features that allow framework developers to control how partitioned tasks should be handled. Mesos operators will also benefit from a discussion of exactly how Mesos behaves during network partitions and other failure scenarios.


Neil Conway

Neil Conway is an engineer at Mesosphere, and an Apache Mesos Committer and PMC Member. At Mesosphere, he has worked on a variety of projects, including partition-aware Mesos frameworks and improving support for stateful Mesos frameworks. Prior to joining Mesosphere, he completed... Read More →

Wednesday August 31, 2016 5:20pm - 6:10pm PDT
Ballroom A
Thursday, September 1

11:15am PDT

Designing and Evaluating a Distributed Computing Language Runtime - Christopher Meiklejohn, Université catholique de Louvain
Consistency is hard and coordination is expensive. As we move into the world of connected 'Internet of Things' style applications, or large-scale mobile applications, devices have less power, periods of limited connectivity, and operate over unreliable asynchronous networks. This poses a problem with shared state: how do we handle concurrent operations over shared state, while clients are offline, and ensure that values converge to a desirable result without making the system unavailable?

We look at a new programming model, called Lasp. This programming model combines distributed convergent data structures with a dataflow execution model designed for distribution over large-scale applications. This model supports arbitrary placement of processing
node: this enables the user to author applications that can be distributed across data centers and pushed to the edge.

In this talk, we will focus on the design and evaluation of the Lasp runtime system: a system written in Erlang with a target scale of 10,000 - 20,000 nodes. We will look at the supporting algorithms that assist in achieving this scale, and how we’ve leveraged Mesos and DC/OS to build a system for performing experiments and identifying bottlenecks as we develop the language.

avatar for Christopher Meiklejohn

Christopher Meiklejohn

Ph.D. candidate, Université catholique de Louvain

Thursday September 1, 2016 11:15am - 12:05pm PDT
Ballroom B

1:30pm PDT

One is not enough - Using Multiple Disk Support - Joris Van Remoortere, Jörg Schad & Felix Hupfeld, Mesosphere
Multiple Disk (MESOS-191) is an age old feature request that was implemented at last year's MesosCon EU hackathon. It allows operators to expose multiple disk resources. This enables frameworks to a) utilize more than disk per agent and b) gain exclusive access to a complete disk device (including disk errors when they attempt to exceed the capacity of the volume instead of having Mesos kill the Task when exceeding its disk resource limit).
This talk will give an overview of the different types of disk resources now available (Root, Path, and Mount Disk) and guidelines on when to use which. Secondly we will discuss how multiple disk resources can be utilized by users who are running their workloads using Marathon. Last we will provide best practices for framework developer who are looking for utilizing this feature.

This will be done on the example of the Mesos Quobyte framework which was one of the first frameworks to take advantage of multiple disk support.


Felix Hupfeld

Founder, Quobyte
Felix manages Quobyte’s technology and pushes development forward. Before that, he worked as a tech lead and capacity planner in Google’s infrastructure team (2009–2013). He was the architect and project manager for XtreemFS (2006–2009). Felix’s PhD was on distributed s... Read More →

Joris Van Remoortere

Joris Van Remoortere is a Senior Software Engineer at Mesosphere. He focuses on large-scale, low-latency, high-throughput systems. He alleviates scaling issues and makes performance and reliability improvements on many projects, including Orly, messaging and payment systems, web tiers... Read More →
avatar for Jörg Schad

Jörg Schad

Head of Machine Learning, ArangoDB
Jörg Schad is Head of Machine Learning at ArangoDB. In a previous life, he worked on built machine learning pipelines in healthcare, distributed systems at Mesosphere, and in-memory databases, conducted research in the Hadoop and Cloud area. He’s a frequent speaker at meetups... Read More →

Thursday September 1, 2016 1:30pm - 2:20pm PDT
Ballroom B

2:30pm PDT

Practical, Resilient Software Defined Networking: A Field Report - Sargun Dhillon & Avinash Sridharan, Mesosphere
Mesos, as it comes out of the box, has some limitations in its network infrastructure. The mechanism out of the box does not work with legacy applications that expect stable ports, and IPs. We built a general purpose, solution using off-the-shelf components to provide overlay networking for Mesos clusters. This overlay network is meant to become the default way of doing networking in Mesosphere's DC/OS. In this talk, Avinash, Jie, and Sargun will talk about the design for the Mesos modules, isolators, and integration into Docker. This mechanism sits along side an external distributed system to orchestrate it, and glue it all together. This system is meant to be a flexible, highly available system that is meant to run in the stable state without any masters. We'll talk about why we decided to go with this design and how it works as deployed in the real DC/OS installations.

avatar for Avinash Sridharan

Avinash Sridharan

Software Engineer, Mesosphere
Love containers, especially networking them !! Have worked (and studied) in the field of networking for the past decade or so. Very interested in designing and developing new network technologies. Want everything to be fast, distributed, and extremely efficient.

Thursday September 1, 2016 2:30pm - 3:20pm PDT
Ballroom B