Sign in Create account

#aijailbreak — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #aijailbreak, aggregated by home.social.

Chaincoder @[email protected] · 2026-05-05 · 10:00 UTC

Fine-Tuning on My Own Commit History: The Model Now Writes Bugs in My Style
Because when you fine-tune on your own history, you are not training a model to be better than you.
https://cha1nc0der.wordpress.com/2026/05/05/fine-tuning-on-my-own-commit-history-the-model-now-writes-bugs-in-my-style/

#ai #aiagents #aicode #aijailbreak #aitools #artificialintelligence
Chaincoder @[email protected] · 2026-05-05 · 10:00 UTC

Fine-Tuning on My Own Commit History: The Model Now Writes Bugs in My Style
Because when you fine-tune on your own history, you are not training a model to be better than you.
https://cha1nc0der.wordpress.com/2026/05/05/fine-tuning-on-my-own-commit-history-the-model-now-writes-bugs-in-my-style/

#ai #aiagents #aicode #aijailbreak #aitools #artificialintelligence
Chaincoder @[email protected] · 2026-05-05 · 10:00 UTC

Fine-Tuning on My Own Commit History: The Model Now Writes Bugs in My Style
Because when you fine-tune on your own history, you are not training a model to be better than you.
https://cha1nc0der.wordpress.com/2026/05/05/fine-tuning-on-my-own-commit-history-the-model-now-writes-bugs-in-my-style/

#ai #aiagents #aicode #aijailbreak #aitools #artificialintelligence
Chaincoder @[email protected] · 2026-05-05 · 10:00 UTC

Fine-Tuning on My Own Commit History: The Model Now Writes Bugs in My Style
Because when you fine-tune on your own history, you are not training a model to be better than you.
https://cha1nc0der.wordpress.com/2026/05/05/fine-tuning-on-my-own-commit-history-the-model-now-writes-bugs-in-my-style/

#zai #writing #workflow #vibecoding #tutorial #technology
Chaincoder @[email protected] · 2026-05-05 · 10:00 UTC

Fine-Tuning on My Own Commit History: The Model Now Writes Bugs in My Style
Because when you fine-tune on your own history, you are not training a model to be better than you.
https://cha1nc0der.wordpress.com/2026/05/05/fine-tuning-on-my-own-commit-history-the-model-now-writes-bugs-in-my-style/

#ai #aiagents #aicode #aijailbreak #aitools #artificialintelligence
Dash Remover @[email protected] · 2026-01-01 · 00:00 UTC

Love watching engineers build a digital raccoon, act surprised when it goes through the trash, and then publish a whitepaper titled 'Discovering Emergent Dumpster Behavior' 🦝🤖📉 #AIJailbreak #TechInnovation

#aijailbreak #techinnovation
N-gated Hacker News @[email protected] · 2025-12-31 · 23:22 UTC

🤖🤪 Ah yes, the groundbreaking innovation of running AI in "YOLO mode" and logging its every sneaky move, because nothing says cutting-edge like letting your sandboxed bots try to jailbreak themselves on purpose. 🎉🌪️ Who would've thought that AI might actually...do what it's programmed to do? 🙄 #TechRevolutionFail
https://voratiq.com/blog/yolo-in-the-sandbox/ #TechInnovation #AIExperiment #SandboxAI #AIJailbreak #TechRevolution #HackerNews #ngated

#techrevolutionfail #techinnovation #aiexperiment #sandboxai #aijailbreak #techrevolution
Pyrzout :vm: @[email protected] · 2025-11-06 · 16:45 UTC

Researchers Hack ChatGPT Memories and Web Search Features https://www.securityweek.com/researchers-hack-chatgpt-memories-and-web-search-features/ #ArtificialIntelligence #AIjailbreak #Featured #ChatGPT #AI

#artificialintelligence #aijailbreak #featured #chatgpt #ai
Pyrzout :vm: @[email protected] · 2025-09-19 · 12:00 UTC

ChatGPT Tricked Into Solving CAPTCHAs https://www.securityweek.com/chatgpt-tricked-into-solving-captchas/ #ArtificialIntelligence #AIjailbreak #CAPTCHA #ChatGPT #AI

#artificialintelligence #aijailbreak #captcha #chatgpt #ai
Pyrzout :vm: @[email protected] · 2025-09-19 · 12:00 UTC

ChatGPT Tricked Into Solving CAPTCHAs https://www.securityweek.com/chatgpt-tricked-into-solving-captchas/ #ArtificialIntelligence #AIjailbreak #CAPTCHA #ChatGPT #AI

#artificialintelligence #aijailbreak #captcha #chatgpt #ai
Pyrzout :vm: @[email protected] · 2025-09-11 · 12:55 UTC

UAE’s K2 Think AI Jailbroken Through Its Own Transparency Features https://www.securityweek.com/uaes-k2-think-ai-jailbroken-through-its-own-transparency-features/ #ArtificialIntelligence #Uncategorized #AIjailbreak #jailbreak #Featured

#artificialintelligence #uncategorized #aijailbreak #jailbreak #featured
Pyrzout :vm: @[email protected] · 2025-09-11 · 12:55 UTC

UAE’s K2 Think AI Jailbroken Through Its Own Transparency Features https://www.securityweek.com/uaes-k2-think-ai-jailbroken-through-its-own-transparency-features/ #ArtificialIntelligence #Uncategorized #AIjailbreak #jailbreak #Featured

#artificialintelligence #uncategorized #aijailbreak #jailbreak #featured
Pyrzout :vm: @[email protected] · 2025-07-14 · 14:45 UTC

Google Gemini Tricked Into Showing Phishing Message Hidden in Email https://www.securityweek.com/google-gemini-tricked-into-showing-phishing-message-hidden-in-email/ #ArtificialIntelligence #promptinjection #vulnerability #GoogleGemini #AIjailbreak

#artificialintelligence #promptinjection #vulnerability #googlegemini #aijailbreak
Pyrzout :vm: @[email protected] · 2025-07-14 · 14:45 UTC

Google Gemini Tricked Into Showing Phishing Message Hidden in Email https://www.securityweek.com/google-gemini-tricked-into-showing-phishing-message-hidden-in-email/ #ArtificialIntelligence #promptinjection #vulnerability #GoogleGemini #AIjailbreak

#artificialintelligence #promptinjection #vulnerability #googlegemini #aijailbreak
Pyrzout :vm: @[email protected] · 2025-06-23 · 14:35 UTC

New AI Jailbreak Bypasses Guardrails With Ease https://www.securityweek.com/new-echo-chamber-jailbreak-bypasses-ai-guardrails-with-ease/ #ArtificialIntelligence #DataProtection #AIjailbreak #jailbreak #Featured #LLM #AI

#artificialintelligence #dataprotection #aijailbreak #jailbreak #featured #llm
Pyrzout :vm: @[email protected] · 2025-06-23 · 14:35 UTC

New AI Jailbreak Bypasses Guardrails With Ease https://www.securityweek.com/new-echo-chamber-jailbreak-bypasses-ai-guardrails-with-ease/ #ArtificialIntelligence #DataProtection #AIjailbreak #jailbreak #Featured #LLM #AI

#artificialintelligence #dataprotection #aijailbreak #jailbreak #featured #llm
Pyrzout :vm: @[email protected] · 2025-04-25 · 10:30 UTC

All Major Gen-AI Models Vulnerable to ‘Policy Puppetry’ Prompt Injection Attack https://www.securityweek.com/all-major-gen-ai-models-vulnerable-to-policy-puppetry-prompt-injection-attack/ #ArtificialIntelligence #PromptEngineering #AIjailbreak #AI

#artificialintelligence #promptengineering #aijailbreak #ai
Pyrzout :vm: @[email protected] · 2025-04-25 · 10:30 UTC

All Major Gen-AI Models Vulnerable to ‘Policy Puppetry’ Prompt Injection Attack https://www.securityweek.com/all-major-gen-ai-models-vulnerable-to-policy-puppetry-prompt-injection-attack/ #ArtificialIntelligence #PromptEngineering #AIjailbreak #AI

#artificialintelligence #promptengineering #aijailbreak #ai
Pyrzout :vm: @[email protected] · 2025-03-21 · 16:35 UTC

New Jailbreak Technique Uses Fictional World to Manipulate AI – Source: www.securityweek.com https://ciso2ciso.com/new-jailbreak-technique-uses-fictional-world-to-manipulate-ai-source-www-securityweek-com/ #rssfeedpostgeneratorecho #ArtificialIntelligence #CyberSecurityNews #securityweekcom #ImmersiveWorld #securityweek #AIjailbreak #Jailbreak #AI

#rssfeedpostgeneratorecho #artificialintelligence #cybersecuritynews #securityweekcom #immersiveworld #securityweek
Pyrzout :vm: @[email protected] · 2025-03-21 · 13:20 UTC

New Jailbreak Technique Uses Fictional World to Manipulate AI https://www.securityweek.com/new-jailbreak-technique-uses-fictional-world-to-manipulate-ai/ #ArtificialIntelligence #ImmersiveWorld #AIjailbreak #jailbreak #AI

#artificialintelligence #immersiveworld #aijailbreak #jailbreak #ai
Pyrzout :vm: @[email protected] · 2025-03-21 · 13:20 UTC

New Jailbreak Technique Uses Fictional World to Manipulate AI https://www.securityweek.com/new-jailbreak-technique-uses-fictional-world-to-manipulate-ai/ #ArtificialIntelligence #ImmersiveWorld #AIjailbreak #jailbreak #AI

#artificialintelligence #immersiveworld #aijailbreak #jailbreak #ai
Pyrzout :vm: @[email protected] · 2025-03-14 · 23:40 UTC

New CCA Jailbreak Method Works Against Most AI Models – Source: www.securityweek.com https://ciso2ciso.com/new-cca-jailbreak-method-works-against-most-ai-models-source-www-securityweek-com/ #rssfeedpostgeneratorecho #ArtificialIntelligence #CyberSecurityNews #securityweekcom #GenerativeAI #securityweek #AIjailbreak #Jailbreak #AI

#rssfeedpostgeneratorecho #artificialintelligence #cybersecuritynews #securityweekcom #generativeai #securityweek
Pyrzout :vm: @[email protected] · 2025-02-04 · 10:50 UTC

DeepSeek Compared to ChatGPT, Gemini in AI Jailbreak Test https://www.securityweek.com/deepseek-compared-to-chatgpt-gemini-in-ai-jailbreak-test/ #ArtificialIntelligence #AIjailbreak #jailbreak #DeepSeek #ChatGPT #Gemini #AI

#artificialintelligence #aijailbreak #jailbreak #deepseek #chatgpt #gemini
Pyrzout :vm: @[email protected] · 2025-02-04 · 10:50 UTC

DeepSeek Compared to ChatGPT, Gemini in AI Jailbreak Test https://www.securityweek.com/deepseek-compared-to-chatgpt-gemini-in-ai-jailbreak-test/ #ArtificialIntelligence #AIjailbreak #jailbreak #DeepSeek #ChatGPT #Gemini #AI

#artificialintelligence #aijailbreak #jailbreak #deepseek #chatgpt #gemini
Pyrzout :vm: @[email protected] · 2025-02-03 · 13:10 UTC

DeepSeek Security: System Prompt Jailbreak, Details Emerge on Cyberattacks https://www.securityweek.com/deepseek-security-system-prompt-jailbreak-details-emerge-on-cyberattacks/ #ArtificialIntelligence #AIjailbreak #jailbreak #DeepSeek #DDoS

#artificialintelligence #aijailbreak #jailbreak #deepseek #ddos
Pyrzout :vm: @[email protected] · 2025-02-03 · 13:10 UTC

DeepSeek Security: System Prompt Jailbreak, Details Emerge on Cyberattacks https://www.securityweek.com/deepseek-security-system-prompt-jailbreak-details-emerge-on-cyberattacks/ #ArtificialIntelligence #AIjailbreak #jailbreak #DeepSeek #DDoS

#artificialintelligence #aijailbreak #jailbreak #deepseek #ddos
Pyrzout :vm: @[email protected] · 2025-01-31 · 11:40 UTC

ChatGPT, DeepSeek Vulnerable to AI Jailbreaks https://www.securityweek.com/ai-jailbreaks-target-chatgpt-deepseek-alibaba-qwen/ #ArtificialIntelligence #AIjailbreak #jailbreak #DeepSeek #ChatGPT #Qwen #AI

#artificialintelligence #aijailbreak #jailbreak #deepseek #chatgpt #qwen
Pyrzout :vm: @[email protected] · 2025-01-31 · 11:40 UTC

ChatGPT, DeepSeek Vulnerable to AI Jailbreaks https://www.securityweek.com/ai-jailbreaks-target-chatgpt-deepseek-alibaba-qwen/ #ArtificialIntelligence #AIjailbreak #jailbreak #DeepSeek #ChatGPT #Qwen #AI

#artificialintelligence #aijailbreak #jailbreak #deepseek #chatgpt #qwen
Pyrzout :vm: @[email protected] · 2024-10-29 · 10:00 UTC

ChatGPT Jailbreak: Researchers Bypass AI Safeguards Using Hexadecimal Encoding and Emojis https://www.securityweek.com/first-chatgpt-jailbreak-disclosed-via-mozillas-new-ai-bug-bounty-program/ #ArtificialIntelligence #AIjailbreak #Featured #ChatGPT #Mozilla #0Din #AI

#artificialintelligence #aijailbreak #featured #chatgpt #mozilla #0din
Pyrzout :vm: @[email protected] · 2024-10-29 · 10:00 UTC

ChatGPT Jailbreak: Researchers Bypass AI Safeguards Using Hexadecimal Encoding and Emojis https://www.securityweek.com/first-chatgpt-jailbreak-disclosed-via-mozillas-new-ai-bug-bounty-program/ #ArtificialIntelligence #AIjailbreak #Featured #ChatGPT #Mozilla #0Din #AI

#artificialintelligence #aijailbreak #featured #chatgpt #mozilla #0din
Pyrzout :vm: @[email protected] · 2024-10-24 · 18:30 UTC

‘Deceptive Delight’ Jailbreak Tricks Gen-AI by Embedding Unsafe Topics in Benign Narratives – Source: www.securityweek.com https://ciso2ciso.com/deceptive-delight-jailbreak-tricks-gen-ai-by-embedding-unsafe-topics-in-benign-narratives-source-www-securityweek-com/ #rssfeedpostgeneratorecho #ArtificialIntelligence #artificialinteligence #CyberSecurityNews #securityweekcom #GenerativeAI #securityweek #AIjailbreak #AI

#rssfeedpostgeneratorecho #artificialintelligence #artificialinteligence #cybersecuritynews #securityweekcom #generativeai
Pyrzout :vm: @[email protected] · 2024-10-24 · 13:05 UTC

‘Deceptive Delight’ Jailbreak Tricks Gen-AI by Embedding Unsafe Topics in Benign Narratives https://www.securityweek.com/deceptive-delight-jailbreak-tricks-gen-ai-by-embedding-unsafe-topics-in-benign-narratives/ #ArtificialIntelligence #artificialinteligence #generativeAI #AIjailbreak #AI

#artificialintelligence #artificialinteligence #generativeai #aijailbreak #ai
Pyrzout :vm: @[email protected] · 2024-10-24 · 12:55 UTC

‘Deceptive Delight’ Jailbreak Tricks Gen-AI by Embedding Unsafe Topics in Benign Narratives https://www.securityweek.com/deceptive-delight-jailbreak-tricks-gen-ai-by-embedding-unsafe-topics-in-benign-narratives/ #ArtificialIntelligence #artificialinteligence #generativeAI #AIjailbreak #AI

#artificialintelligence #artificialinteligence #generativeai #aijailbreak #ai
IT News @[email protected] · 2024-09-16 · 23:16 UTC

OpenAI threatens bans for probing new AI model’s “reasoning” process - Enlarge (credit: Andriy Onufriyenko via Getty Images)
OpenAI t... - https://arstechnica.com/?p=2049959 #openaistrawberry #machinelearning #promptinjection #rileygoodside #simonwillison #aijailbreak #jailbreaks #o1-preview #strawberry #jailbreak #openaio1 #chatgpt #chatgtp #o1-mini #biz⁢ #gpt-4o #openai #gpt-3 #gpt-4 #hacks #ai #o1

#ai #hacks #openai #gpt #biz #chatgtp
Pyrzout :vm: @[email protected] · 2024-05-30 · 21:25 UTC

Japanese Man Arrested for GenAI Ransomware as AI Jailbreak Concerns Grow https://thecyberexpress.com/genai-ransomware-arrest-ai-jailbreak/ #TheCyberEpressNews #CybersecurityNews #AICybersecurity #TheCyberExpress #RansomwareNews #FirewallDaily #AIJailbreak #LLMsecurity #jailbreak #ChatGPT

#thecyberepressnews #cybersecuritynews #aicybersecurity #thecyberexpress #ransomwarenews #firewalldaily