Downloads, formats, and saves the CPT dataset. print("Downloading raw SQL dataset from Hugging Face...") # This dataset contains ~78k highly quality Text-to-SQL pairs ...