This is a multi-domain German-English parallel data introduced in Aharoni and Goldberg (2020) . It is a new data split created that avoids duplicate examples and leakage from the train split to the dev/test splits. The original multi-domain data first appeared in Koehn and Knowles (2017) and consists of five datasets available in the Opus website .