A curated collection of multimodal large language models — Auto-Regressive, Diffusion, and Hybrid approaches