A Status Quo Investigation of Large Language Models towards Cost-Effective CFD Automation with OpenFOAMGPT: ChatGPT vs. Qwen vs. Deepseek

📅 2025-04-02

📈 Citations: 0

✨ Influential: 0

career value

194K/year

🤖 AI Summary

This study systematically evaluates the effectiveness, stability, and cost-efficiency of ChatGPT, Qwen, and DeepSeek—integrated within the OpenFOAMGPT framework—for automating computational fluid dynamics (CFD) workflows, focusing on boundary condition specification, turbulence model configuration, and solver generation. Method: We propose a novel large language model (LLM) architecture combining instruction fine-tuning and tool augmentation, supporting both zero-shot and expert-guided prompting, and deployable locally (e.g., QwQ-32B) or in the cloud. Contribution/Results: We first empirically reveal the fundamental limitations of zero-shot prompting for complex CFD setups: all models exhibit >40% error rates in critical parameter generation, precluding unsupervised expert replacement. Qwen and DeepSeek achieve performance comparable to ChatGPT on medium-complexity tasks while reducing inference costs substantially. Our work establishes the first empirical benchmark and practical deployment roadmap for industrial-grade LLM-augmented CFD automation.

Technology Category

Application Category

📝 Abstract

We evaluated the performance of OpenFOAMGPT incorporating multiple large-language models. Some of the present models efficiently manage different CFD tasks such as adjusting boundary conditions, turbulence models, and solver configurations, although their token cost and stability vary. Locally deployed smaller models like QwQ-32B struggled with generating valid solver files for complex processes. Zero-shot prompting commonly failed in simulations with intricate settings, even for large models. Challenges with boundary conditions and solver keywords stress the requirement for expert supervision, indicating that further development is needed to fully automate specialized CFD simulations.

Problem

Research questions and friction points this paper is trying to address.

Evaluating LLM performance in automating CFD tasks with OpenFOAMGPT

Assessing token cost and stability variations across different LLM models

Identifying challenges in zero-shot prompting for complex CFD simulations

Innovation

Methods, ideas, or system contributions that make the work stand out.

Multiple large-language models for CFD tasks

Local smaller models struggle with complex files

Zero-shot prompting fails in intricate simulations

🔎 Similar Papers

No similar papers found.