We built a custom (experimental) file transformation pipeline into our agent because we don't want to be at the mercy of LLM providers for giving models native access to files. Essentially, we extend the agent to expose files to the LLMs context through a standardized pipeline, allowing to convert any file into a file type specific best representation that only consist out of text, image (and audio/video) parts.