Uses https://tika.apache.org/ to extract metadata and text across a variety of file types. It relies on interfacing to an external tika server, most commonly deployed ...