ZFLOW AI

ZFLOW AI · 2025-10-08T15:32:19.189Z

𝗭𝗙𝗟𝗢𝗪 𝗔𝗜 𝗮𝘁 𝗖𝗔𝗦𝗣𝗔 𝟮𝟬𝟮𝟱: 𝗨𝗻𝗶𝗳𝘆𝗶𝗻𝗴 𝗢𝗽𝘁𝗶𝗺𝗶𝘇𝗮𝘁𝗶𝗼𝗻 𝗔𝗰𝗿𝗼𝘀𝘀 𝘁𝗵𝗲 𝗔𝗜 𝗦𝘁𝗮𝗰𝗸 At this year’s CASPA Annual Conference (Sep 27, 2025), themed “𝗔𝗜 𝗘𝗰𝗼𝘀𝘆𝘀𝘁𝗲𝗺 𝗥𝗲𝘃𝗼𝗹𝘂𝘁𝗶𝗼𝗻”, our Founder, CEO, and CASPA President & Chairman 𝗗𝗿. Zhibin (David) Xiao shared how ZFLOW AI is driving the next wave of innovation through 𝗮 𝘂𝗻𝗶𝗳𝗶𝗲𝗱 𝗼𝗽𝘁𝗶𝗺𝗶𝘇𝗮𝘁𝗶𝗼𝗻 𝗽𝗹𝗮𝘁𝗳𝗼𝗿𝗺 that connects AI models, compilers, schedulers, and hardware systems. More than a simulation tool, ZFLOW AI delivers 𝗲𝗻𝗱-𝘁𝗼-𝗲𝗻𝗱 𝗼𝗽𝘁𝗶𝗺𝗶𝘇𝗮𝘁𝗶𝗼𝗻—from model analysis and compiler tuning to runtime scheduling and system-level performance prediction. This approach enables architects and developers to design, evaluate, and optimize the entire AI stack before deployment, accelerating innovation from silicon to cloud to edge. Dr. Xiao’s vision reflects the spirit of the AI Ecosystem Revolution—advancing collaboration between software and hardware to make AI systems more efficient, scalable, and accessible for all. #AI #AIInfra #CAPEX #Simulation #Optimization #HardwareSoftwareCodesign #ZFLOWAI #CASPA2025 #AIEcosystemRevolution

Software Development

Santa Clara, CA 61 followers

Bridging Hardware and Software to Optimize AI Systems at Scale

Discover all 3 employees

About us

ZFLOW AI is redefining AI infrastructure through simulation and optimization. Our platform helps engineers, researchers, and architects profile AI workloads, explore hardware-software tradeoffs, and optimize performance before deployment.

Website: https://www.zflow.ai
External link for ZFLOW AI
Industry: Software Development
Company size: 2-10 employees
Headquarters: Santa Clara, CA
Type: Privately Held
Founded: 2024

Locations

Primary

Santa Clara, CA 95054, US

Get directions

Employees at ZFLOW AI

Dongning Shen

Office Manager | Administrative & Accounting Support | MS-PFP, MBA–Finance/Information Systems

See all employees

Updates

ZFLOW AI

61 followers
8h
Report this post
Great update from the vLLM team — the new plugin system is a big step forward for flexible, upstream-safe LLM serving. This aligns perfectly with ZFLOW AI’s simulation-driven approach. We need a serving engine that can be customized (scheduling, KV-cache behavior, model variants) without forking. vLLM’s plugin layer gives us exactly that. This will let us integrate custom optimization logic directly into our simulation stack and deploy the same logic in production through ZFLOW Serve — closing the loop between simulation → optimization → real deployment. Excited to explore deeper synergy here. #AIInfrastructure #LLMInference #vLLM #ZFLOWAI

vLLM

6,519 followers
18h

Need to customize vLLM? Don't fork it. 🔌 vLLM's plugin system lets you inject surgical modifications without maintaining a fork or monkey-patching entire modules. Blog by Dhruvil Bhatt from AWS SageMaker 👇 Why plugins > forks: • vLLM releases every 2 weeks with 100s of PRs merged • Forks require constant rebasing & conflict resolution • Monkey patches break on every vLLM upgrade How it works: • Use VLLMPatch[TargetClass] for precise, class-level mods • Register via vllm.general_plugins entry point • Control patches with env vars (VLLM_CUSTOM_PATCHES) • Version-guard with @min_vllm_version decorator Example: Add priority scheduling to vLLM's scheduler in ~20 lines. One Docker image serves multiple models with different patches enabled via environment variables. The plugin loads in ALL vLLM processes (main, workers, GPU/CPU) before any inference starts—ensuring consistent behavior across distributed setups. Read the full implementation guide with code examples: https://lnkd.in/e4U_xeFa

Building Clean, Maintainable vLLM Modifications Using the Plugin System blog.vllm.ai

Like Comment Share
ZFLOW AI

61 followers
1mo Edited
Report this post
𝗭𝗙𝗟𝗢𝗪 𝗔𝗜 𝗮𝘁 𝗖𝗔𝗦𝗣𝗔 𝟮𝟬𝟮𝟱: 𝗨𝗻𝗶𝗳𝘆𝗶𝗻𝗴 𝗢𝗽𝘁𝗶𝗺𝗶𝘇𝗮𝘁𝗶𝗼𝗻 𝗔𝗰𝗿𝗼𝘀𝘀 𝘁𝗵𝗲 𝗔𝗜 𝗦𝘁𝗮𝗰𝗸 At this year’s CASPA Annual Conference (Sep 27, 2025), themed “𝗔𝗜 𝗘𝗰𝗼𝘀𝘆𝘀𝘁𝗲𝗺 𝗥𝗲𝘃𝗼𝗹𝘂𝘁𝗶𝗼𝗻”, our Founder, CEO, and CASPA President & Chairman 𝗗𝗿. Zhibin (David) Xiao shared how ZFLOW AI is driving the next wave of innovation through 𝗮 𝘂𝗻𝗶𝗳𝗶𝗲𝗱 𝗼𝗽𝘁𝗶𝗺𝗶𝘇𝗮𝘁𝗶𝗼𝗻 𝗽𝗹𝗮𝘁𝗳𝗼𝗿𝗺 that connects AI models, compilers, schedulers, and hardware systems. More than a simulation tool, ZFLOW AI delivers 𝗲𝗻𝗱-𝘁𝗼-𝗲𝗻𝗱 𝗼𝗽𝘁𝗶𝗺𝗶𝘇𝗮𝘁𝗶𝗼𝗻—from model analysis and compiler tuning to runtime scheduling and system-level performance prediction. This approach enables architects and developers to design, evaluate, and optimize the entire AI stack before deployment, accelerating innovation from silicon to cloud to edge. Dr. Xiao’s vision reflects the spirit of the AI Ecosystem Revolution—advancing collaboration between software and hardware to make AI systems more efficient, scalable, and accessible for all. #AI #AIInfra #CAPEX #Simulation #Optimization #HardwareSoftwareCodesign #ZFLOWAI #CASPA2025 #AIEcosystemRevolution
Like Comment Share

ZFLOW AI

Software Development

Santa Clara, CA 61 followers

Bridging Hardware and Software to Optimize AI Systems at Scale

About us

Locations

Employees at ZFLOW AI

Dongning Shen

Office Manager | Administrative & Accounting Support | MS-PFP, MBA–Finance/Information Systems

Updates

Join now to see what you are missing

Similar pages

Moffett.AI

CASPA - Chinese American Semiconductor Professional Assoc.

Netpreme

HUMAIN

Kubiq

LuxiTech

Innouveaute, LLC

Kubiqs

Kubiq

Agentry, Inc.