home.social

#claude4 — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #claude4, aggregated by home.social.

  1. AI가 전문가 업무 40% 대체? 헤드라인이 놓친 결정적 사실

    GPT-5가 전문가 업무의 40%를 수행한다는 벤치마크 결과, 하지만 그 이면에 숨겨진 인간의 역할과 AI 시대 새로운 업무 방식인 할당 경제를 알아봅니다.

    aisparkup.com/posts/5472

  2. #GLM45 frontier #AI #LLM with 355B parameters ranks 3rd against #OpenAI #Claude4 #Gemini across benchmarks 🤖

    🧠 Two variants: GLM-4.5 (355B total/32B active parameters) and GLM-4.5-Air (106B total/12B active)
    🔄 Hybrid reasoning models with thinking mode for complex tasks and non-thinking mode for instant responses
    🏆 Ranks 3rd overall against #OpenAI, #Anthropic, #Google #DeepMind models across 12 benchmarks

    🧵 👇

  3. #Claude4 #Sonnet and I just created this new #AIAgent (#AgenticAI) app called Context Dictionary for our #ICandy #browser #dashboard. A user can upload a text or pdf file to read in this app. To check the definition and contextual meaning of a word, the user only needs to left click the mouse. A bubble will appear to give the user the contextual meaning of the word. We are using #IBM #granite 3.3 8B on my local LM Studio server as the AI agent.

    #LLM #AI #dictionary #English

  4. What is an #AIAgent or #AgenticAI whatever you want to call it? #Claude4 #Sonnet wrote an AI Agent yesterday that takes an English word to create a line with #English explanation and a sample sentence that uses the word.

    I used #IBM Granite 3.3 8B (4.1 G bytes) small #LLM to process 3000 English words to produce over 500M bytes of data overnight while I was sleeping. I can now use this mega data set to feed the #GRE #Flashcard module of my #ICandy #browser #dashboard.

    #LLMs #AI

  5. #Claude4 #Sonnet and I just added a new #Latex module to our ICandy #browser dashboard. Latex is the best typsetting tool for #mathematicians, #economists, #engineers, ...This module is not fully functional yet. But we will be adding mathjax and pdftex to it some time today.

    #latex #math #LLM #AI

  6. #Claude4 #Sonnet just created a pdf editor inside our #ICandy #browser dashboard. Now, I can double click a #pdf #Ebook in my #ELibrary module in ICandy. It will be displayed inside ICandy for me to read, highlight, and add note annotation to the ebook. #AI #LLM #LLMs

    It is not easy to work with LLMs or other AIs. However, it is not difficult either. You only need to know some basic computer operations and the theory behind the current #transformer #CNN LLM models.

  7. Can I fully replace #Windows 11's desktop using my #ICandy #bowser #dashboard created by #Claude4 #Sonnet and myself?

    The problem we encounter is the so-called browser #security that prevents us from accessing local directories and files easily. This is obviously part of #Microsoft's #OS #monopoly strategy.

    In just a week, Claude4 wrote 13 apps in ICandy with 2 of them using #AgenticAIs using the small #LLM 's from #LMstudio backed up by those from #Ollama. #LLMs #AI

  8. What if #calendar marries #to-do-list and they have a baby? This is exactly the To-do-list calendar module that #Claude4 #Sonnet and I created for our #ICandy dashboard browser project. We created a total of 3 new browser apps today. #AI #LLM

  9. #Claude4 #Sonnet and I created 8 #browser apps and 1 #AgenticAI in my #ICandy dashboard browser app in just 5 days. Our latest creations include a #Youtube to MP3 grabber that automatically downloads #videos and convert them into #mp3 files. These mp3 files can then be played in our newly created #Music Player module.

    The power of AI is limitless. It is a matter of your #computer literacy, your AI literacy, and your imagination.

  10. #Claude4 #Sonnet and I just created this new music player module for our #ICandy #dashboard #browser project. My goal is to replace the desktop and move most of my activities onto the browser. Browser is the future of computing. You can now run #Webassembly and #WebGPU on a browser. In the future all #videogames can be run in a browser. You can have a dedicated #AI or a team of #AgenticAI 's working on the browser working for you locally and on the #internet. #gaming #LLM

  11. I added two modules (Weather Forecast and Digital Clock) to my #ICandy #dashboard #browser #app today with the help of #Claude4 #Sonnet. As the project gets bigger, it is more difficult to work with Sonnet because of message size, conversation size and quota limits. But it is ok. I was chatting with #Microsoft #Copilot during my down time. Now I learned more about how #VoiceVox, #CORS, #MCP, and local #HTTP work. #AI #AIs are the best invention ever for people who are willing to learn.

  12. I can finally play #chess with #stockfish on this #ICandy #Dashboard that I created using #Claude4 #Sonnet. What would have taken a month to develop only took me 2 days. Who says #AIs are bad? AIs bring freedom to humans. We don't need to be controlled by the big corporations in the Silicon Valley. I am only paying $200 for Claude pro. It took me less than a week to create this Candy Dashboard. I will keep adding features like mail check, #mastodon, a clock, ... to it. Goodbye #desktop. #AI

  13. Can #Claude4 #Sonnet finish building the #Stockfish #AI #chess #engine app for my #ICandy #browser widget panel app in one day? We have been working very hard all afternoon. It just ran out of quota and I need to wait 5 hours for it to resume its hard work again. I will be playing with my dog and then do some reading on x86 Assembly prepared by Sonnet. Then I will work with Sonnet again. #Computers make some people smarter, but some dumber, so do #AIs. So, don't blame #AIs.

  14. How many apps can I create with the help of #Claude4 #Sonnet in one day in my #ICandy #Browser #app? We created 3 apps and 1 #AgenticAI. 3 of them were based on 3 non-browser apps we created and Sonnet created 1 additional #notepad app that can use #Markdown. ICandy is based on #IGoogle 's idea. That is, you can run everything in a browser without relying on any #desktop apps. The browser AI agent can generate Japanese stories and use #VoiceVox #TTS to read the stories in Japanese.

  15. 🚨 60,000 tokens. One AI. Zero transparency.
    Claude isn’t just answering—you’re hearing its hidden leash talk.
    This exposé uncovers the secret 'constitution' shaping every word it says.
    🧠 System prompts = modern censorship.
    💬 What else are your AIs hiding?

    👇 Read the full piece:
    medium.com/@rogt.x1997/claudes
    #AIethics #Claude4 #PromptEngineering #TechGovernance
    medium.com/@rogt.x1997/claudes

  16. 🚀 Claude 4 didn’t just assist—it outperformed.
    In a 7-hour live dev session, it refactored legacy Java with zero hallucinations, full memory, and enterprise-grade precision.

    🔍 We compared Claude 4 vs ChatGPT across 5 key metrics — and the results will surprise you.

    📖 Read the full breakdown:
    👉 medium.com/@rogt.x1997/claude-

    📌 #Claude4 #LLMbenchmark #AIengineering #Anthropic
    medium.com/@rogt.x1997/claude-

  17. "Anthropic publish most of the system prompts for their chat models as part of their release notes. They recently shared the new prompts for both Claude Opus 4 and Claude Sonnet 4. I enjoyed digging through the prompts, since they act as a sort of unofficial manual for how best to use these tools. Here are my highlights, including a dive into the leaked tool prompts that Anthropic didn’t publish themselves.

    Reading these system prompts reminds me of the thing where any warning sign in the real world hints at somebody having done something extremely stupid in the past. A system prompt can often be interpreted as a detailed list of all of the things the model used to do before it was told not to do them.

    I’ve written a bunch about Claude 4 already. Previously: Live blogging the release, details you may have missed and extensive notes on the Claude 4 system card.

    Throughout this piece any sections in bold represent my own editorial emphasis."

    simonwillison.net/2025/May/25/

    #AI #GenerativeAI #Claude #Claude4 #Anthropic #SystemPrompts #PromptEngineering #LLMs #Chatbots

  18. Anthropic's new Claude 4 models, Opus & Sonnet, are here. Opus aced Pokémon for 24 hrs, showing wild long-term memory. Huge for AI agents staying on track through complex, lengthy tasks. #AI #Claude4 #AgentAI

  19. 🧠 Another flagship model released! #Anthropic just unveiled Claude Opus 4 and Claude Sonnet 4, and they are at the top of the leaderboard for coding 💻

    📰 Check out the announcement: anthropic.com/news/claude-4

    #AI #GenAI #LLMs #Claude #Claude4 #SweBench