frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Using Microsoft Copilot Enterprise, 80% of time the AI falsified results or code

https://info.microsoft.com/ww-landing-four-paths-to-business-value-with-ai.html?lcid=en-us
4•verhash•1h ago

Comments

verhash•1h ago
Over the past several months I started using the Copilot Enterprise to learn coding as a hobby and help me develop my skills, which Microsoft has advertised on their site how it would be great for business use and we should all adopt agents and use them for all various tasks. I've noticed over say 80% of my projects at least where the agents have falsified code, falsified results, and when confronted about it. They will admit that they did but only when confronted do they offer and promise that they'll fix it, Which of course I'm expected to cover the cost of them fixing their own mistake.

I've tested across all three models that are available on there from Gemini 3.1, Claude 4.8 and ChatGPT 5.5 and all three of them did exactly the same thing just last night. so I thought maybe you know maybe I was doing something wrong. I reached out to Copilot about it which he offered to empower my business with Ai, offered me some paid training course, which I would have to pay for of course.

I have previously reached out to customer service on it and not even gotten offered a refund, no credit, nothing. Now I work in healthcare and one of my greatest concerns is someone using Ai for coding purposes. I believe Ai can be a great tool to help empower people and teach them news skills and hobbies, but what if somebody uses these coding services for medical equipment. Well it beep and green light turned on, ai said it goods, it's good.

Thankfully we do have regulations and quality checks here, but in other places where that might not be such a standard. Even on Microsoft website it advertises, Product coding time cut by 90% NIQ transformed one of its most resource-intensive workflows with Microsoft Foundry, processing 32,000 products in 10 hours instead of weeks while expanding market reach and client delivery. So I'm wondering are other people's experiences with using coding agents and how are they alleviated and mitigated these risks. How is Microsoft ensuring these Risks are mitigated or reduced, but they continue advertising them and promoting them.

Oras•1h ago
is that a wrong url? its a landing page for eBook by MS, has nothing to do with the title

Billionaire's Warning: I'm Selling. The Crash Is Here [video]

https://www.youtube.com/watch?v=32u5T6lO8qk
1•hsnewman•2m ago•0 comments

Claude Code Hints at Fable Return

https://twitter.com/synthwavedd/status/2069813760622043483
1•thedebuglife•2m ago•0 comments

The Trump White House Is over Anthropic CEO Dario Amodei

https://www.wired.com/story/the-trump-white-house-is-over-anthropics-dario-amodei/
1•thedebuglife•3m ago•0 comments

What if some of history's earliest kings were queens?

https://www.nationalgeographic.com/history/article/history-earliest-kings-ur-sumeria
1•bookofjoe•3m ago•0 comments

Route Through Dead Zones. 302x Faster Than Google

https://www.elara-cortex.com/
1•jkuria•4m ago•0 comments

My New Life with the Palantir Chore Coat

https://www.theatlantic.com/technology/2026/06/palantir-chore-coat/687686/
1•jonah•4m ago•0 comments

Anthropic accuses Alibaba of largest distillation attack to date

https://www.cnbc.com/2026/06/24/anthropic-alibaba-distillation-campaign.html
1•paulddraper•4m ago•0 comments

Ask HN: Do you thank your agents when they did a good job?

1•ex-aws-dude•5m ago•0 comments

We're rebuilding financial services with AI

https://arcawealth.ai
1•smalt•9m ago•0 comments

US Supreme Court scales back Roundup cancer lawsuits

https://www.reuters.com/world/us-supreme-court-scales-back-roundup-cancer-lawsuits-2026-06-25/
4•Tomte•12m ago•0 comments

Complete text of carbonised Herculaneum scroll unlocked for first time

https://www.reuters.com/science/complete-text-carbonised-herculaneum-scroll-unlocked-first-time-2...
1•tylerchr•13m ago•0 comments

As banks close accounts, experts point to immigration crackdown

https://www.americanbanker.com/news/as-banks-close-accounts-experts-point-to-immigration-crackdown
5•petethomas•14m ago•0 comments

IBM Debuts First Sub-1 Nanometer Chip Technology

https://newsroom.ibm.com/2026-06-25-ibm-debuts-worlds-first-sub-1-nanometer-chip-technology
3•porridgeraisin•14m ago•0 comments

UK sales of electric vehicles just overtook petrol cars for the first time

https://www.carbonbrief.org/analysis-uk-sales-of-electric-vehicles-just-overtook-petrol-cars-for-...
2•j4mie•14m ago•0 comments

NationStates Turns Twenty (2022)

https://maxbarry.com/2022/11/12/news.html
1•altilunium•16m ago•0 comments

The U.S. last beat screwworm in 1966

https://www.texastribune.org/2026/06/25/texas-screwworm-history-eradication/
2•mzs•17m ago•0 comments

GTA 6 prices by country, $58 in South Korea to $107 in Israel

https://www.shanethegamer.com/esports-news/gta-6-prices-by-country/
2•misbloss•17m ago•0 comments

Gunship: Origins Devlog #1 – Flying the Latest Build [video]

https://www.youtube.com/watch?v=0fWPkThNmzg
1•skibz•17m ago•0 comments

How to Design Arrows

https://pangrampangram.com/blogs/journal/arrows
1•speckx•17m ago•0 comments

Show HN: Clutch – decentralized ride-sharing on-chain (open source)

https://clutchprotocol.io/
1•mehran_mazhar•18m ago•0 comments

Tracking and analyzing 340 AI leaders/blogs to find out what matters

https://brightray.ai/
1•lundbe•18m ago•1 comments

There's One Clear Reason Americans Are Gloomy About A.I

https://www.nytimes.com/2026/06/25/opinion/ai-americans-pessimism.html
1•xbryanx•18m ago•0 comments

Show HN: iOS Apps on Linux

https://github.com/Lore-Hex/QuillUI
1•ljlolel•19m ago•0 comments

Halvar's Guide to Entrepreneurship

https://thomasdullien.github.io/guides/entrepreneurship/
1•nekitamo•19m ago•0 comments

Capabilities.txt

https://www.capabilitiestxt.org/
2•pschmied•21m ago•0 comments

Show HN: Contraband – Ticketed VR cinema for indie and AI-made films

https://contraband.watch
1•phaedrus044•22m ago•0 comments

IBM says it can fit nearly 100B transistors on a chip

https://www.zdnet.com/education/computers-tech/ibm-claims-beyond-nanometer-milestone-with-sub-1-n...
1•CrankyBear•26m ago•0 comments

Why aren't there more AlphaFolds?

https://nkeivan.com/writing/why-no-more-alphafolds
1•nimski•26m ago•0 comments

Go Easy on the Feeds, Reddit

https://openrss.org/blog/go-easy-on-the-feeds-reddit
3•theanonymousone•26m ago•0 comments

The Agent Is Not the Scanner: Making AI Security Agents Better

https://shad0wmazt3r.github.io/ai-security
2•speckx•26m ago•0 comments