Machine Learning 2 Exploring the gradient noise scale Aug 28, 2025 Multidimensional RoPE Jul 31, 2025