Alibaba has claimed that its open-source large model Qwen2-Math delivers “state-of-the-art” math competency, saying it handles mathematical problems in algebra and geometry with 84% accuracy, and has outperformed OpenAI’s GPT 4o and Google’s Gemini 1.5 Pro. The math-centered model, which was trained on Alibaba’s foundation Qwen2 model by feeding it a large-scale high-quality math corpus, is currently only supported in English, with a bilingual version coming soon, according to the tech giant. The Alibaba team has challenged Qwen2-Math with multiple benchmarks including China’s latest GaoKao (college entrance exam) math problems and questions from US math competition AIME, which all proved the model’s proficiency in dealing with advanced mathematical problems. [QbitAI, in Chinese]
文章
94
浏览
14
获赞
51
Chase bank tried to be relatable on Twitter and got absolutely dunked on
Brands, may we remind you for the umpteenth time, that if you're trying to get #relatable on TwitterEven Facebook shareholders are sick of Mark Zuckerberg's excuses
Facebook's own shareholders are losing patience for Mark Zuckerberg's excuses. At the company's annuReview: Snap's new Spectacles are the boring update we needed
Snap "we're a camera company" Inc. just launched its second piece of hardware: the next version of iBar sparks internet outrage after horribly offensive Cinco de Mayo celebration
Though Cinco de Mayo is an important day of celebration and historical remembrance in Mexico, unfortGoogle Maps and YouTube Music just made some commutes a little better
Google Maps has featured music controls for Spotify, Apple Music, and Google Play since 2018, but itIn EU hearing, Mark Zuckerberg dodged lawmakers' tough questions
Mark Zuckerberg is back on his bullshit. The Facebook CEO appeared before Members of the European PaFacebook announces plans to build 'Clear History' tool to combat privacy concerns
Facebook's annual developer conference, F8, has historically been a celebration of the company's greEthereum hits $800 ahead of its big week
Ethereum, the second-largest cryptocurrency by market cap (behind Bitcoin), has a potentially tumultChase bank tried to be relatable on Twitter and got absolutely dunked on
Brands, may we remind you for the umpteenth time, that if you're trying to get #relatable on TwitterHackers exploit smart thermometer to steal casino information
Having a whole bunch of smart objects like lights, appliances, and thermometers can make life a littReport: Another personality quiz exposed Facebook data from millions
Welp, it looks like another quiz app may have exposed millions of Facebook users' personal data.A peOculus Go review: VR has never been so good for so cheap
I can’t think of a more messy and complicated technology in the last decade than VR.I’veCoronavirus is not the man now dog: YTMND is back, and just in time
The pandemic profoundly alters our sense of time. Quarantine grinds lives to a halt, injecting themCities strive for improvement after Amazon HQ2 rejection
As Amazon moves forward in its HQ2 selection process, it's acting like a really, really considerateWhatsApp is banning teens under 16 in Europe ahead of privacy law changes
For teenagers in the European Union, WhatsApp is about to change in a big way. The Facebook-owned me