Loading…
or to bookmark your favorites and sync them to your phone or calendar.
Subject: AI DevWorld: AI/ML Engineering Conference clear filter
arrow_back View All Dates
Thursday, February 20
 

10:00am PST

[Virtual] OPEN Session: Developing the Most Efficient LLM Inferencing
Thursday February 20, 2025 10:00am - 10:25am PST
Moshe Twitto, Pliops, Founder & CTO

Organizations are increasingly concerned about the lack of power budgets in data centers, particularly as AI infrastructure and emerging AI applications lead to higher energy footprints and strain cooling systems. As they scale their AI operations and add GPU compute tiers, the escalating power and cooling demands, coupled with significant capital investments in GPUs, are eroding margins. A monumental challenge looms as data centers struggle to secure essential power, creating significant pressure for companies striving to expand their AI capabilities.

In today's LLM inferencing computing, GPU prefill operations are heavily compute-bound and critically determine the batch size. While prefill can fully utilize GPU resources, increasing the batch size beyond a certain point only increases the Time to First Token (TTFT) without improving prefill rate. On the other hand, GPU decode operations are HBM bandwidth-bound and mainly influenced by model and KV cache sizes, benefiting significantly from larger batch sizes through higher HBM bandwidth efficiency. Pliops' solution improves prefill time, allowing for larger batch sizes without violating user SLA for prefill operations. This enhancement directly affects decode performance as well, as it gains greatly from the increased batch size. As a result, by improving prefill time, the system achieves nearly proportional improvements in end-to-end throughput.
Speakers
avatar for Moshe Twitto

Moshe Twitto

Founder & CTO, Pliops
Moshe is the CTO and co-Founder of Pliops and an expert in advanced data management and coding algorithms. Prior to co-founding Pliops, Moshe served as CTO of Samsung’s SSD Controller Development Center in Israel, holds MSEE, BSEE degrees from Technion University, Summa Cum Laude... Read More →
Thursday February 20, 2025 10:00am - 10:25am PST
VIRTUAL AI DevWorld OPEN Stage

10:30am PST

[Virtual] PRO Session: AI Frontiers: Shielding Digital Gateways from Bot Invasions
Thursday February 20, 2025 10:30am - 10:55am PST
Parth Shukla, Cequence Security, Security Analyst
Khyati Ganatra, Cequence Security,  Manager, Applied ML

In the presentation titled "AI Frontiers: Shielding Digital Gateways from Bot Invasions," we delve into the forefront of cyber defense against bot-driven threats that exploit API vulnerabilities. This comprehensive study explores how advanced AI and ML models are being harnessed to fortify digital defenses, offering a detailed analysis of API communication patterns and the evolving landscape of bot attacks. Through a series of real-world case studies, we illuminate the mechanisms of sophisticated bot strategies—ranging from data breaches and account takeovers to shopping bots that deplete inventories. The narrative progresses to unveil how AI/ML technologies serve as the cornerstone of innovative defense mechanisms. We dissect the architecture of AI-driven systems tailored to detect and counteract anomalous behaviors indicative of bot activities, leveraging vast datasets to train ML models that adeptly differentiate between legitimate user interactions and malicious bot intrusions. The discussion further navigates through the technical and operational nuances of implementing AI/ML defenses, emphasizing predictive analytics for preemptive action, machine learning for dynamic threat adaptation, and the overarching impact of such technologies in securing digital ecosystems against the insidious threats posed by automated attacks. This presentation not only highlights the challenges but also showcases the resilience and adaptability of AI/ML solutions in the ever-evolving battle against digital villains.
Speakers
avatar for Khyati Ganatra

Khyati Ganatra

Manager, Applied ML, Cequence Security
I am deeply fascinated by AI's profound ability to reshape industries and redefine the way we live and work. With a keen interest in the intersection of ML and cybersecurity, I have dedicated my career to developing cutting-edge ML solutions that protect organizations from malicious... Read More →
avatar for Parth Shukla

Parth Shukla

Security Analyst, Cequence Security
Parth Shukla is a cyber security analyst at Cequence Security and has a great passion for Web Application Security. Parth Shukla is also a Bug hunter; community builder and Cyber security enthusiast and I believe in the quote “security is a myth”.
Thursday February 20, 2025 10:30am - 10:55am PST
VIRTUAL AI DevWorld Main Stage

11:00am PST

[Virtual] PRO Session: Reproducible AI with Langchain and lakeFS
Thursday February 20, 2025 11:00am - 11:25am PST
Oz Katz, Treeverse, CTO & Co-Founder

Langchain has become one of the most popular frameworks for anyone building custom, generative AI-driven apps powered by LLMs, that leverage RAG for the most enhanced results. But like all data products, these applications are really only as good as the org data fed into them.

In this live session you’ll learn how to build a reproducible AI app pipeline with Langchain & lakeFS, including how to build a RAG chatbot, while iteratively tuning it for best results leveraging Delta Lake’s temporal versions, you’ll come away with improved methods for data reproducibility for custom AI apps.
Speakers
avatar for Oz Katz

Oz Katz

CTO & Co-Founder, Treeverse
Oz Katz is the CTO and Co-Creator of the open source lakeFS Project, an open source platform that delivers resilience and manageability to object-storage based data lakes. Oz engineered and maintained petabyte-scale data infrastructure at analytics giant SmilarWeb, which he joined... Read More →
Thursday February 20, 2025 11:00am - 11:25am PST
VIRTUAL AI DevWorld Main Stage
 

Share Modal

Share this link via

Or copy link

Filter sessions
Apply filters to sessions.
Filtered by Date -