AI Region-aware pre-training for open-vocabulary object detection with vision transformers – Google Research Blog
AI Sakana AI Introduces KAME: A Tandem Speech-to-Speech Architecture That Injects LLM Knowledge in Real Time