This is the tech report for a model I helped work on. I'm biased, but it turned out very well.
We essentially let the model learn to retrieve like a human would: Make a first search, read the results, and then make another. This lets the model be vastly better than pre-programmed pipelines. We test this extensively and compare against implementing this with API models (like Sonnet 4.5 and GPT-5.1). SID-1 compares favorably.
Happy to answer any questions or get feedback. First and foremost: Enjoy the read. It's much more detailed than most tech reports.
maxrumpf•1d ago
We essentially let the model learn to retrieve like a human would: Make a first search, read the results, and then make another. This lets the model be vastly better than pre-programmed pipelines. We test this extensively and compare against implementing this with API models (like Sonnet 4.5 and GPT-5.1). SID-1 compares favorably.
Happy to answer any questions or get feedback. First and foremost: Enjoy the read. It's much more detailed than most tech reports.