Training or tagging performance needs to be validated somehow. I have the that a good sanity check would be to make sure that the ratio of poor-quality data should be similar across the different filters.
This distribution between filters looks fishy:
To upload designs, you'll need to enable LFS and have an admin enable hashed storage. More information