Well, that data loader config is a problem, but it’s not what’s causing the first error you posted. The error that you said was on
items = json.loads(line). That’s when you’re trying to load your data file inside the dataset reader. Your data file is apparently not formatted as correct json.
You also have an issue with your data loader config. We’ll have an upgrade guide for 1.0 posted soon, but here’s the relevant part for the data loaders:
Iterators ➔ DataLoaders
Allennlp now uses PyTorch’s API for data iteration, rather than our own custom one. This means that
validation_iterator arguments to the
Trainer have been removed and replaced with
Previous config files which looked like:
"sorting_keys": [["tokens"], ["num_tokens"]],
// sorting keys are no longer required! They can be inferred automatically.