2024-11-26 Star Attention: Efficient LLM Inference over Long Sequences

2024-11-26 Star Attention: Efficient LLM Inference over Long Sequences