The number of training examples processed together in one forward/backward pass during model training. Affects training speed, memory usage, and model convergence behavior.