Whisper is an open-source automatic speech recognition (ASR) and speech translation model released by OpenAI in September 2022 under the
RT-2 (Robotics Transformer 2) is a Vision-Language-Action model published by Google DeepMind in 2023 that pioneered the modern VLA paradigm.


