Skip to content

Commit 835ed29

Browse files
authored
reflection-tuning dataset generation (#349)
1 parent 8ad50a3 commit 835ed29

File tree

7 files changed

+1077
-4
lines changed

7 files changed

+1077
-4
lines changed

README.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -122,7 +122,8 @@ Several folders contain optional materials as a bonus for interested readers:
122122
- **Chapter 7:**
123123
- [Dataset Utilities for Finding Near Duplicates and Creating Passive Voice Entries](ch07/02_dataset-utilities)
124124
- [Evaluating Instruction Responses Using the OpenAI API and Ollama](ch07/03_model-evaluation)
125-
- [Generating a Dataset for Instruction Finetuning](ch07/05_dataset-generation)
125+
- [Generating a Dataset for Instruction Finetuning](ch07/05_dataset-generation/llama3-ollama.ipynb)
126+
- [Improving a Dataset for Instruction Finetuning](ch07/05_dataset-generation/reflection-gpt4.ipynb)
126127
- [Generating a Preference Dataset with Llama 3.1 70B and Ollama](ch07/04_preference-tuning-with-dpo/create-preference-data-ollama.ipynb)
127128
- [Direct Preference Optimization (DPO) for LLM Alignment](ch07/04_preference-tuning-with-dpo/dpo-from-scratch.ipynb)
128129

ch07/05_dataset-generation/README.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,7 @@
1-
# Generating a Dataset for Instruction Finetuning
1+
# Generating Datasets for Instruction Finetuning
22

33
This folder contains utility code that can be used for generating a dataset for instruction finetuning.
44

55
- [llama3-ollama.ipynb](llama3-ollama.ipynb): A notebook that creates a synthetic instruction finetuning dataset using Llama 3 and Ollama
66

7+
- [reflection-gpt4.ipynb](reflection-gpt4.ipynb): A notebook that implements an instruction dataset refinement step based on reflection-tuning
Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,4 @@
1+
{
2+
"OPENAI_API_KEY": "sk-...",
3+
"_comment": "Enter your API key from https://platform.openai.com/api-keys"
4+
}

ch07/05_dataset-generation/llama3-ollama.ipynb

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -498,7 +498,7 @@
498498
"name": "python",
499499
"nbconvert_exporter": "python",
500500
"pygments_lexer": "ipython3",
501-
"version": "3.10.6"
501+
"version": "3.11.4"
502502
}
503503
},
504504
"nbformat": 4,

0 commit comments

Comments
 (0)