Skip to content

Commit 89c5f3a

Browse files
committed
Revert unrelated changes
1 parent a14d31c commit 89c5f3a

4 files changed

Lines changed: 4 additions & 14 deletions

File tree

NOTES.md

Lines changed: 0 additions & 5 deletions
This file was deleted.

rdagent/components/coder/data_science/pipeline/prompts.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -90,7 +90,6 @@ pipeline_coder:
9090
--------- Feedback to former code ---------
9191
{{ latest_code_feedback }}
9292
The former code contains errors. You should correct the code based on the provided information, ensuring you do not repeat the same mistakes.
93-
Keep the part that already seem correct intact. Avoid modifying them to refrain from introducing new errors.
9493
{% else %}
9594
The former code is correct. You should try to improve the code based on the provided task while not changing the irrelevant parts.
9695
{% endif %}

rdagent/oai/backend/base.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -343,7 +343,7 @@ def _try_create_chat_completion_or_embedding( # type: ignore[no-untyped-def]
343343
if hasattr(e, "message") and (
344344
"'messages' must contain the word 'json' in some form" in e.message
345345
or "\\'messages\\' must contain the word \\'json\\' in some form" in e.message
346-
):
346+
):
347347
kwargs["add_json_in_prompt"] = True
348348
elif hasattr(e, "message") and embedding and "maximum context length" in e.message:
349349
kwargs["input_content_list"] = [

rdagent/scenarios/data_science/dev/prompts.yaml

Lines changed: 3 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -15,9 +15,7 @@ exp_feedback:
1515
- Recommend corrective actions explicitly.
1616
- Set `"Replace Best Result": "no"`.
1717
- Begin your `reasoning` with `[Submission format error]`, clearly stating the issues causing experiment failure.
18-
- If submission passes the submission format check:
19-
- If this is the first valid submission ever, set `"Replace Best Result": "yes"`.
20-
- Otherwise, proceed to Step 2.
18+
- If submission passes, proceed to Step 2.
2119
2220
Step 2: Evaluate Alignment with Competition Requirements (if format correct)
2321
- GOAL: CAREFULLY ANALYZE WHETHER THE EXPERIMENTAL SETUP AND CODE MAY CAUSE MISALIGNMENT BETWEEN VALIDATION AND TEST PERFORMANCE.
@@ -58,8 +56,6 @@ exp_feedback:
5856
Provide detailed and constructive feedback structured as follows:
5957
Example JSON Structure for Result Analysis:
6058
{
61-
"Submission Format Check": "yes or no",
62-
"First Valid Submission": "yes or no",
6359
"Observations": "Clearly summarize current and SOTA ensemble results with exact scores and notable patterns. Limit to no more than three concise, data-focused sentences.",
6460
"Feedback for Hypothesis": Explicitly confirm or refute the hypothesis based on specific data points or performance trends. Limit to two sentences.",
6561
"Evaluation Aligned With Task": "yes or no",
@@ -111,10 +107,10 @@ exp_feedback:
111107
{{ cur_exp.experiment_workspace.all_codes }}
112108
113109
## Feedback of past experiments
114-
{{ feedback_desc or "There has not been any experiments yet." }}
110+
{{ feedback_desc }}
115111
Please refer to these hypotheses and feedback to help you recommend new experiment and hypothesis
116112
117113
Tips:
118-
- Step 1: If submission format has issues, prioritize fixing them before proceeding. If the format is correct and it's the first valid submission ever (there has never been valid submissions in the past), set `"Replace Best Result": "yes"`. If the format is correct and this is not the first valid submission, proceed to Step 2.
114+
- Step 1: If submission format has issues, prioritize fixing them before proceeding.
119115
- Step 2: If evaluation alignment issues are identified (validation approach does not follow competition requirements), address these methodological discrepancies immediately.
120116
- Step 3: If new results significantly worse than SOTA, or repeated hyperparameter adjustments yield no improvement, it might be time to rethink or shift focus.

0 commit comments

Comments
 (0)