David is a Senior AI/ML Engineer within the Office of the CTO at NetApp, where he’s dedicated to empowering developers to build, scale, and deploy AI/ML solutions in production environments. He brings deep expertise in building and training models for applications such as NLP, vision, real-time analytics, and even classifying debilitating diseases. His mission is to help users build, train, and deploy AI models efficiently, making advanced machine learning accessible to users of all levels.

Before NetApp, he was heavily involved in the AI/ML community, specifically in conversational AI solutions and driving AI platform growth in a DevRel and pre-sales role. David frequently shares his insights at industry conferences and events, offering hands-on guidance for implementing AI/ML in cloud environments. David's prior experience includes contributing to the Kubernetes and CNCF ecosystems, working hands-on with VMware virtualization, implementing backup/recovery solutions, and developing firmware and drivers for hardware storage adapters.

Open Source Contributions: http://www.github.com/davidvonthenen
LinkedIn: https://linkedin.com/in/davidvonthenen
YouTube: https://www.youtube.com/@davidvonthenen
Blog: https://davidvonthenen.com

Presentations

23x

The Sound of Your Secrets: Teaching Your Model to Spy, So You Can Learn to Defend

AI can now listen to your keyboard and guess what you're typing. This session shows how deep learning models can reconstruct text from keystroke sounds, then breaks down how these attacks work and how to defend against them. It's a live, hands-on look at the thin line between innovation and exploitation in modern AI security. Bring your curiosity and maybe a little paranoia.

See Presentation
23x

A Practical Guide to Training a Small Language Model: Tokenizers, Training, and Real-World Pitfalls

This session walks through how to build and train your own Small Language Model using open source tools. You'll learn how to select datasets, reuse BERT's tokenizer, design a smaller student model, and apply knowledge distillation to preserve accuracy. We'll share code, show common pitfalls, and help you deploy an efficient, real-world SLM that runs fast on modest hardware.

See Presentation
22x

Training Multi-Modal ML Classification Models for Real-Time Detection of Debilitating Disease

This session breaks down the training process of multi-modal machine learning models for real-time detection of debilitating diseases using video and audio data. Attendees will learn how to build and integrate video and audio classifiers, curate multi-modal datasets, and deploy these models in real-world settings. The session includes practical demonstrations and code samples to help participants implement their own multi-modal classification models, equipping them with tools and methodologies to apply in the healthcare industry or other fields requiring complex real-time detection.

See Presentation
22x

Demystifying Building Natural Language Processing ML Models and How to Leverage Them By Example

This session describes and simplifies the process of building NLP models by guiding attendees through creating a basic model, followed by a more advanced redaction model from scratch. The session offers practical insights into model development, including dataset preparation and model training, while providing code to build and implement the models discussed. By the end of the session, participants will walk away with a proven framework for approaching NLP problems and the tools to apply these concepts immediately in their work.

See Presentation
21x

Voice-Activated AI Collaborators: A Hands-On Guide Using LLMs in IoT & Edge Devices

Discover the power of voice-assisted IoT and Edge interfaces in this practical session. Learn to create effective voice-activated devices, using LLMs like Falcon and OpenLLaMA. Gain insights into design decisions, essential components, and collaborative frameworks.

See Presentation
16x

Application Monitoring and Tracing in Kubernetes: Avoiding Microservice Hell!

Creating and deploying Microservices is easy. The real problem is how to manage and support these services out in the wild and in production. What happens when these services stop working or worse yet when they are running but running slowly? Which service instance is the culprit? This session talks about how you can leverage Jaeger, OpenTracing, and Prometheus in order to give better visibility into the distributed nature of a Microservice architecture in a Kubernetes environment.

See Presentation
15x

How Container Schedulers and Software Defined Storage will Change the Cloud

Persistent applications can be complex to manage and operate at scale, but tend to be perfect for modern schedulers such as Apache Mesos, Kubernetes, and etc. Internal direct attached storage and external storage are both options to run your applications. This talk outlines how 2 Layer Scheduling and Software Defined Storage enables deployment of managed frameworks and tasks, while maintaining high availability, scale-out growth, and automation.

See Presentation