Fine-Tuning an Open-Source LLM with Axolotl Using Direct Preference Optimization (DPO)