DEVICE/BROWSER INFO
aatventure
In this video, I test Inception's new Mercury 2, a diffusion-based large language model that introduces reasoning capabilities and generates text at 1,000 tokens per second.
I demonstrate its speed and instruction-following through coding tests, and then evaluate its practical utility by integrating it into a real-time voice assistant and my open-source RAG agent.