will there be any challenges with scaling if the dataset starts to get really large? | discoverkit | discoverkit