“Benja” — Fediverse search results on home.social

People @[email protected] · 2026-05-18 · 23:33 UTC

https://www.europesays.com/people/76583/ Netanyahu announces projects for Jerusalem, including Western Wall – Israel & Jewish News #BenjaminNetanyahu

#benjaminnetanyahu

Iwpost @[email protected] · 2026-05-18 · 23:25 UTC

Bennett and Eisenkot Lead Netanyahu in Israeli Prime Minister Suitability Poll. #BenjaminNetanyahu #GadiEisenkot #NaftaliBennett #together #YairLapid
https://iwpost.com/bennett-and-eisenkot-lead-netanyahu-in-israeli-prime-minister-suitability-poll/?fsp_sid=8540

#benjaminnetanyahu #gadieisenkot #naftalibennett #together #yairlapid

People @[email protected] · 2026-05-18 · 22:26 UTC

https://www.europesays.com/people/76515/ Netanyahu says Israeli army nearing completion of Gaza mission, signals readiness for all Iran scenarios – Middle East Monitor #BenjaminNetanyahu

#benjaminnetanyahu

People @[email protected] · 2026-05-18 · 20:10 UTC

https://www.europesays.com/people/76364/ As Netanyahu spotlights Israel’s ties to the UAE, its rulers prefer to be discreet #BenjaminNetanyahu

#benjaminnetanyahu

People @[email protected] · 2026-05-18 · 19:02 UTC

https://www.europesays.com/people/76285/ Bennett, Eisenkot meet as anti-Netanyahu bloc weighs next steps #BenjaminNetanyahu

#benjaminnetanyahu

Bytes Europe @[email protected] · 2026-05-18 · 19:02 UTC

Benjamin Netanyahu’s War at Home https://www.byteseu.com/2032345/ #Conflicts #Israel #israelis #PrimeMinisterBenjaminNetanyahu #War

#war #primeministerbenjaminnetanyahu #israelis #israel #conflicts

The New Yorker [Unofficial] @[email protected] · 2026-05-18 · 18:51 UTC

Benjamin Netanyahu’s War at Home

https://fed.brid.gy/r/https://www.newyorker.com/news/the-lede/benjamin-netanyahus-war-at-home

#war #israelis #israel #primeministerbenjaminnetanyahu #newsthelede

People @[email protected] · 2026-05-18 · 16:38 UTC

https://www.europesays.com/people/76132/ Israel’s diminished standing in the court of public opinion #BenjaminNetanyahu

#benjaminnetanyahu

People @[email protected] · 2026-05-18 · 15:27 UTC

https://www.europesays.com/people/76058/ Rebranding US aid to Israel #BenjaminNetanyahu

#benjaminnetanyahu

People @[email protected] · 2026-05-18 · 15:26 UTC

https://www.europesays.com/people/76056/ Israeli court postpones Netanyahu corruption hearing for ‘security’ reasons – Middle East Monitor #BenjaminNetanyahu

#benjaminnetanyahu

Petra van Cronenburg @[email protected] · 2026-05-18 · 14:37 UTC

@BenjaminHCCarr I can't imagine that this could be legal outside of the USA.
And for science, it should be a question of ethics even there.

#ethics #Science #academicChatter #education #school

#ethics #science #academicchatter #education #school

People @[email protected] · 2026-05-18 · 13:01 UTC

https://www.europesays.com/people/75904/ Netanyahu and the US paradox: The alliance held, the consensus broke #BenjaminNetanyahu

#benjaminnetanyahu

Benjamin Han @[email protected] · 2026-05-18 · 00:26 UTC

This weekend in addition to my #99 #Parkrun (https://sigmoid.social/@BenjaminHan/116585831426535983), I also made a 2nd run afternoon on the trails: a solid 10K run with a bit of hail raining down on me at some point!

#Running #Trailrunning #Photo #PNW

#trailrunning #parkrun #running #photo #pnw

50+ Music @[email protected] · 2026-05-17 · 23:13 UTC

"The Night Has a Thousand Eyes" is a song written by #BenjaminWeisman, Dorothy Wayne, and Marilyn Garrett. It became a popular hit in 1962 for #BobbyVee and has had several cover versions over the years.
https://www.youtube.com/watch?v=LJfpDaFOkA0

#benjaminweisman #bobbyvee

50+ Music @[email protected] · 2026-05-17 · 23:13 UTC

"The Night Has a Thousand Eyes" is a song written by #BenjaminWeisman, Dorothy Wayne, and Marilyn Garrett. It became a popular hit in 1962 for #BobbyVee and has had several cover versions over the years.
https://www.youtube.com/watch?v=LJfpDaFOkA0

#benjaminweisman #bobbyvee

50+ Music @[email protected] · 2026-05-17 · 23:13 UTC

"The Night Has a Thousand Eyes" is a song written by #BenjaminWeisman, Dorothy Wayne, and Marilyn Garrett. It became a popular hit in 1962 for #BobbyVee and has had several cover versions over the years.
https://www.youtube.com/watch?v=LJfpDaFOkA0

#benjaminweisman #bobbyvee

50+ Music @[email protected] · 2026-05-17 · 23:13 UTC

"The Night Has a Thousand Eyes" is a song written by #BenjaminWeisman, Dorothy Wayne, and Marilyn Garrett. It became a popular hit in 1962 for #BobbyVee and has had several cover versions over the years.
https://www.youtube.com/watch?v=LJfpDaFOkA0

#benjaminweisman #bobbyvee

Benjamin Han @[email protected] · 2026-05-17 · 18:22 UTC

@pbloem That's a good question! I wrote up a longer answer to your question at https://benjaminhan.net/posts/20260517-self-correction-after-reasoning-models/?utm_source=mastodon&utm_medium=social

The short version: yes, the recent reasoning-model training *internalizes* what used to be an inference-time external signals. Question is can we do it universally.

#LLMs #Reasoning #Metacognition

#llms #reasoning #metacognition

Bytes Europe @[email protected] · 2026-05-17 · 08:46 UTC

Benjamin Netanyahu sold Israel’s security for personal deals with Donald Trump, Avigdor Liberman https://www.byteseu.com/2028362/ #AvigdorLiberman #BenjaminNetanyahu #DonaldTrump #HarediDraft #Israel #IsraelElections #NetanyahuTrial

#netanyahutrial #israelelections #israel #haredidraft #donaldtrump #benjaminnetanyahu

Benjamin Han @[email protected] · 2026-05-17 · 05:17 UTC

Reflexion splits self-correction in two: an Evaluator that detects success/failure, and a Self-Reflection model that diagnoses what went wrong. The Evaluator's external signal — heuristic, exact-match, or test execution — gates whether diagnosis fires. When that signal misfires, as on MBPP Python's high false-negative rate, Self-Reflection rewrites correct code wrong, exactly the failure mode Cannot-Self-Correct documented.

https://benjaminhan.net/posts/20260516-reflexion/?utm_source=mastodon&utm_medium=social

#LLMs #AI #Reasoning #Agents #Metacognition

#llms #ai #reasoning #agents #metacognition

Benjamin Han @[email protected] · 2026-05-17 · 05:17 UTC

Reflexion splits self-correction in two: an Evaluator that detects success/failure, and a Self-Reflection model that diagnoses what went wrong. The Evaluator's external signal — heuristic, exact-match, or test execution — gates whether diagnosis fires. When that signal misfires, as on MBPP Python's high false-negative rate, Self-Reflection rewrites correct code wrong, exactly the failure mode Cannot-Self-Correct documented.

https://benjaminhan.net/posts/20260516-reflexion/?utm_source=mastodon&utm_medium=social

#LLMs #AI #Reasoning #Agents #Metacognition

#llms #ai #reasoning #agents #metacognition

Benjamin Han @[email protected] · 2026-05-17 · 05:17 UTC

Reflexion splits self-correction in two: an Evaluator that detects success/failure, and a Self-Reflection model that diagnoses what went wrong. The Evaluator's external signal — heuristic, exact-match, or test execution — gates whether diagnosis fires. When that signal misfires, as on MBPP Python's high false-negative rate, Self-Reflection rewrites correct code wrong, exactly the failure mode Cannot-Self-Correct documented.

https://benjaminhan.net/posts/20260516-reflexion/?utm_source=mastodon&utm_medium=social

#LLMs #AI #Reasoning #Agents #Metacognition

#metacognition #agents #reasoning #ai #llms

Benjamin Han @[email protected] · 2026-05-17 · 05:17 UTC

Reflexion splits self-correction in two: an Evaluator that detects success/failure, and a Self-Reflection model that diagnoses what went wrong. The Evaluator's external signal — heuristic, exact-match, or test execution — gates whether diagnosis fires. When that signal misfires, as on MBPP Python's high false-negative rate, Self-Reflection rewrites correct code wrong, exactly the failure mode Cannot-Self-Correct documented.

https://benjaminhan.net/posts/20260516-reflexion/?utm_source=mastodon&utm_medium=social

#LLMs #AI #Reasoning #Agents #Metacognition

#llms #ai #reasoning #agents #metacognition

Benjamin Han @[email protected] · 2026-05-17 · 05:17 UTC

Cannot-Self-Correct tests the strong claim that LLMs can revise their own reasoning answers without any external signal about correctness. Across three benchmarks (GSM8K, CommonSenseQA, HotPotQA), the answer is no: the model's confidence carries over from the initial answer into the revision, and the self-correction loop tends to degrade rather than improve performance. The result refutes the class of approach Self-Refine belongs to.

https://benjaminhan.net/posts/20260516-cannot-self-correct/?utm_source=mastodon&utm_medium=social

#LLMs #AI #Reasoning #Metacognition

#llms #ai #reasoning #metacognition

Benjamin Han @[email protected] · 2026-05-17 · 05:16 UTC

In Self-Refine, a single frozen LLM acts as generator, critic, and rewriter in a prompt-only loop, and the paper reports about 20 points of average lift across seven tasks without any training, RL, or external signal. The gains vary widely by task: small on math reasoning, but large on dialogue and constrained generation, where what counts as "good" is hardest to define from a one-line critique.

https://benjaminhan.net/posts/20260516-self-refine/?utm_source=mastodon&utm_medium=social

#SelfRefine #LLMs #AI #Reasoning #Metacognition

#selfrefine #llms #ai #reasoning #metacognition

Benjamin Han @[email protected] · 2026-05-17 · 02:18 UTC

Anthropic launched The Anthropic Institute — a four-pillar research agenda introducing a third governance document type at frontier labs alongside declared values and deployment gates, set up to produce empirical findings the other layers can be checked against. OpenAI's recent "Adaptability" principle commits to updating positions as evidence comes in; TAI is built for that.

https://benjaminhan.net/posts/20260516-anthropic-institute-agenda/?utm_source=mastodon&utm_medium=social

#AI #Anthropic #Policy #Society #Economics #Ethics