You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm trying to evaluate Mistral-7b based model with custom locality and portability data.
For each of 50 edits I have 6 locality prompts and 2 portability ones.
How should I arange the dicts to feed them into an edit function in that case? Will the variable below feeded to portability_inputs work as intended?
And a technical one, are the metrics calculated after each edit? If yes, is there an option to evaluate everything on the final model after 50 sequential edits?
Thank you :)
The text was updated successfully, but these errors were encountered:
Your usage is correct; just ensure that the number of items in the prompts and ground_truth under each dimension, such as "english" and "polish," are consistent.
You can also check if the number of metrics recorded in the logs matches the number of input prompts.
Q2:
I haven't implemented this feature yet, which allows for unified evaluation after full editing, but you can refer to the pseudocode in this Continual Editing #220. I will improve this feature in the next version. Thank you!
Hi!
I'm trying to evaluate Mistral-7b based model with custom locality and portability data.
For each of 50 edits I have 6 locality prompts and 2 portability ones.
How should I arange the dicts to feed them into an edit function in that case? Will the variable below feeded to portability_inputs work as intended?
And a technical one, are the metrics calculated after each edit? If yes, is there an option to evaluate everything on the final model after 50 sequential edits?
Thank you :)
The text was updated successfully, but these errors were encountered: