Helpsteer: Multi-attribute helpfulness dataset for steerlm Z Wang, Y Dong, J Zeng, V Adams, MN Sreedhar, D Egert, O Delalleau, ... arXiv preprint arXiv:2311.09528, 2023 | 4 | 2023 |
NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment G Shen, Z Wang, O Delalleau, J Zeng, Y Dong, D Egert, S Sun, J Zhang, ... arXiv preprint arXiv:2405.01481, 2024 | | 2024 |