Some of the default readers will download models for image captioning
Specifically
- PptxReader
- VideoAudioReader
If you don't want to use those readers, either remove those file types from the directory, exclude those file types in the in simple directory reader constructor, or pass in another reader to override the defaults in the simple directory reader constructor (would likely be a custom reader, or something from http://llamahub.ai)