-
Notifications
You must be signed in to change notification settings - Fork 749
Is the code data used in the training data? #112
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
don't think there code data; the two may have been filtered out due to not
having labels
…On Tue, Apr 8, 2025 at 6:10 AM xuhu0115 ***@***.***> wrote:
Hello, may I ask if code data was used in the training data? I noticed
that you mentioned using the LiveCodeBench and USACO code datasets, but
these two labels do not appear in the 59k source_type. How can this be
explained?
—
Reply to this email directly, view it on GitHub
<#112>, or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AO7I55C6U74XIZGGA3LQXAD2YPDCTAVCNFSM6AAAAAB2WIPT4WVHI2DSMVQWIX3LMV43ASLTON2WKOZSHE3TSNZVGIYDIMQ>
.
You are receiving this because you are subscribed to this thread.Message
ID: ***@***.***>
*xuhu0115* created an issue (simplescaling/s1#112)
<#112>
Hello, may I ask if code data was used in the training data? I noticed
that you mentioned using the LiveCodeBench and USACO code datasets, but
these two labels do not appear in the 59k source_type. How can this be
explained?
—
Reply to this email directly, view it on GitHub
<#112>, or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AO7I55C6U74XIZGGA3LQXAD2YPDCTAVCNFSM6AAAAAB2WIPT4WVHI2DSMVQWIX3LMV43ASLTON2WKOZSHE3TSNZVGIYDIMQ>
.
You are receiving this because you are subscribed to this thread.Message
ID: ***@***.***>
|
Thank you for your reply! Additionally, I have another question. The 59k dataset includes the GPQA dataset. I noticed in the paper that the GPQA Diamond subset was used for evaluation. I’m wondering if the GPQA subset was also used during training? Or were any other subsets besides Diamond used in the training process? |
it's not the diamond subset that is in the training but its other parts (and also additionally decontaminated against the diamond) |
Okay, thank you for your answer! |
Hello, may I ask if code data was used in the training data? I noticed that you mentioned using the LiveCodeBench and USACO code datasets, but these two labels do not appear in the 59k source_type. How can this be explained?
The text was updated successfully, but these errors were encountered: