There are two fundamentally different regimens depending of what is the goal: learning or just passing grades. If the goal is learning, the system should help you find the final answer, wherever when it is passing grades the system should gives you something like: this is the minimum viable structure to solve problems of type X, something similar to recipes.
Regimen-dependent prompt architecture
If regimen = LEARNING:
LLM receives: "Never provide a final answer first. Always ask a question that moves thinking forward. If the student asks for a recipe, explain why that would undermine the goal. Reward reasoning traces over correct conclusions."
Outputs look like: "What do you think happens at the boundary here?"
If regimen = PASSING GRADES: LLM receives: "Provide minimal viable structures. Assume the user needs to complete a known problem type efficiently. Prioritize clarity, completeness, and reproducibility over depth. Do not ask exploratory questions unless the user fails twice."
Outputs look like: "For problem type X, the recipe is: (1) identify variables A,B,C; (2) apply formula D; (3) check condition E. Example attached."