High latency on Opus model under load with large context
5Claude Opus experiences significant latency spikes when processing requests with 200K token context windows during periods of high load, impacting real-time application responsiveness.
performanceClaude Opus