🤖 AI Summary
This work proposes LAMPS, a multi-agent collaborative system based on large language models (LLMs) to detect malicious packages in the PyPI repository that exhibit strong semantic obfuscation and evade traditional rule-based detection methods. LAMPS employs a modular architecture coordinated via the CrewAI framework, integrating four specialized agents that synergistically combine fine-tuned CodeBERT for classification and LLaMA-3 for contextual reasoning. This approach enables high-precision, interpretable, and scalable supply chain security analysis. Evaluated on the D1 and D2 datasets, LAMPS achieves accuracy rates of 97.7% and 99.5%, respectively, significantly outperforming baseline methods including MPHunter, retrieval-augmented generation (RAG), and single-agent approaches, thereby overcoming the limitations of individual models in recognizing complex malicious behavioral patterns.