#AI optimization
Latest news articles tagged with "AI optimization". Follow the timeline of events, related topics, and entities.
Articles (3)
-
πΊπΈ CATP: Cross-Attention Token Pruning for Accuracy Preserved Multimodal Model Inference
[USA]
arXiv:2404.08567v2 Announce Type: replace-cross Abstract: In response to the rising interest in large multimodal models, we introduce Cross-Attention Token Pruning (CATP), a precision-focused token p...
Related: #Multimodal processing, #Computational efficiency -
πΊπΈ MASPRM: Multi-Agent System Process Reward Model
[USA]
arXiv:2510.24803v2 Announce Type: replace-cross Abstract: Practical deployment of multi-agent systems (MAS) demands strong performance at test time, motivating methods that guide search during infere...
Related: #Multi-agent systems, #Computational efficiency -
πΊπΈ Difficulty-Aware Agentic Orchestration for Query-Specific Multi-Agent Workflows
[USA]
arXiv:2509.11079v5 Announce Type: replace Abstract: Large Language Model (LLM)-based agentic systems have shown strong capabilities across various tasks. However, existing multi-agent frameworks ofte...
Related: #Multi-agent systems, #Computational efficiency