Hacker news

  • Top
  • New
  • Past
  • Ask
  • Show
  • Jobs

MAI-Thinking-1 (https://microsoft.ai)

193 points by LER0ever 4 days ago | 83 comments | View on ycombinator

keeda 4 days ago |

> Second, clean data. MAI-Thinking-1 was trained on clean and appropriately licensed data, with AI-generated content excluded from pre-training. This matters for quality, provenance, and control. If we cannot account for what shaped a model, we cannot fully understand its behavior or credibly improve it.

Shots fired?

It would be interesting to see how far "clean data" can go on the scaling laws.

__natty__ 4 days ago |

It's good there is a new player on the market, I take benchmark tables with a grain of salt, however. Speaking about model presentation it's funny to see how clearly their website is inspired by other AI company blogs with extra innovation of hijacked scrollbar.

jampekka 3 days ago |

The benchmarks are a bit of a disaster? It's at about DeepSeek V3.2 level, but with about 50% more parameters. Loses handily to the also smaller GLM-5.1, and even worse to the similarly sized Kimi K2.6.

pixeldash928 4 days ago |

Looks like the OAI divergence is finally taking place. Seems like the comparisons are mainly with Opus 4.6 and GPT 5.4 though. Still, exciting to see a new frontier player.

Centigonal 4 days ago |

> MAI-Thinking-1 is a 35B-active, ~1T-total parameters, sparse Mixture of Experts model, a smaller inference footprint than much larger models.

This seemingly nonsensical sentence (of course this will have a smaller inference footprint than larger models) suggests this model's competitors have larger inference footprints and total parameter sizes.

Alifatisk 4 days ago |

> MAI-Thinking-1 is built with enterprise readiness in mind. It supports long context with a 256k token window

Isn’t 1M becoming the norm?

aesthesia 3 days ago |

What's interesting is that although they don't seem to be releasing the model weights, they have published a technical report (https://microsoft.ai/wp-content/uploads/2026/06/main_2026060...) that's more extensive than the typical open-weights model gets.

dang 3 days ago |

Related ongoing thread:

MAI-Code-1-Flash - https://news.ycombinator.com/item?id=48374466 - June 2026 (131 comments)

deflator 2 days ago |

Does this mean that work created with it can be copyrighted? Since the courts ruled that the inclusion of pilfered IP was the reason other model's work cannot be copyrighted, I would think so! In that case, this is a completely different beast. It can maybe be used for things that need a durable copyright.

BeetleB 4 days ago |

Based on the first table, why would I pick this over GLM?

lordmauve 4 days ago |

We need to see DeepSWE scores. SWE Bench Pro is junk.

hartator 4 days ago |

I like it so much when a website hijacks the way my scroll works. This is truly innovative.

wmf 4 days ago |

At least there shouldn't be any complaints about benchmaxing this time.

bossyTeacher 4 days ago |

7 modes launched. 5 models in the dropdown. Only 4 actually usable :(

About time Microsoft joined the fray. After the OpenAI divorce, it really looked like Microsoft was going to become another Uber.

kstenerud 4 days ago |

They've hijacked scrolling. They've hijacked the spacebar. It flickers like crazy when I try to move through the article. Trying to get through it is an exercise in madness.

vcryan 4 days ago |

It really looks like they used Claude to design this webpage. I guess the color taupe it the marker of good AI today.

basilikum 3 days ago |

Why is microsoft.ai hosted on an ASN called WPEngine and not by Microsoft themselves?

kaicianflone 3 days ago |

Is that a pretext zoom effect when changing screen dimensions? Very cool.

euphetar 3 days ago |

Honestly, a lame release of mediocre models.

I was most excited about the "frontier tuning." Like, it will actually watch you do stuff and learn to do it for you? That would be actually interesting.

But no, it's just a data labelling interface: https://learn.microsoft.com/en-us/microsoft-365/copilot/copi.... You have to provide the instruction and give feedback and there is a whole UI with hour-lonf wait between steps. So basically they want you to do the labelling to train a model, or at least that's how it looks from the outside

Also the mission statement of Humanist AI is the most boring, but tries to sound way too grand. Like "all the cool labs have a mission statement, so we should also have one" vibes

gigatexal 3 days ago |

Anyone believing those benchmark numbers from a 35B model?

simjnd 4 days ago |

Absolutely disgusting scroll jacking, even when "Accessibility mode" is turned on

undefined 4 days ago |

undefined

throwawayffffas 3 days ago |

Meh, 1T parameters no weights? I am running a better model right now on 40GB of VRAM.

andai 3 days ago |

[dead]