AISec [x_feed]


Channel's geo and language: not specified, not specified
Category: not specified


News, papers and research about artificial intelligence security from X and other resources for you.
Pwn AI [channel] - https://t.me/pwnai

Related channels  |  Similar channels

Channel's geo and language
not specified, not specified
Category
not specified
Statistics
Posts filter






@giskard_ai: Grégory Herbé has joined Giskard as a Talent Acquisition Lead 🥳 Beyond his 15+ years in the recruitment world, there's a bit more to Greg's story. He is a proud father to three amazing boys and has called Luxembourg home since 2015, after spending a decade in Paris, coming… https://t.co/BrtqE4PBkK
https://twitter.com/giskard_ai/status/1760213779713814712


@wunderwuzzi23: A couple of weeks ago I shared some tricks on how to run custom Python code and explore Bard's "Code Interpreter" environment. If you use Gemini Advanced these tricks are not needed anymore! As of today it's super easy to run custom Python code, even has an edit button for… https://t.co/sTOoqw7bUf
https://twitter.com/wunderwuzzi23/status/1760191541862359183


The AI Security Pyramid of Pain

We introduce the AI Security Pyramid of Pain, a framework that adapts the cybersecurity Pyramid of Pain to categorize and prioritize AI-specific threats. This framework provides a structured approach to understanding and addressing various levels of AI threats. Starting at the base, the pyramid emphasizes Data Integrity, which is essential for the accuracy and reliability of datasets and AI models, including their weights and parameters. Ensuring data integrity is crucial, as it underpins the effectiveness of all AI-driven decisions and operations. The next level, AI System Performance, focuses on MLOps-driven metrics such as model drift, accuracy, and false positive rates. These metrics are crucial for detecting potential security breaches, allowing for early intervention and maintenance of AI system integrity. Advancing further, the pyramid addresses the threat posed by Adversarial Tools, identifying and neutralizing tools used by adversaries to target AI systems. This layer is key to staying ahead of evolving attack methodologies. At the Adversarial Input layer, the framework addresses the detection and mitigation of inputs designed to deceive or exploit AI models. This includes techniques like adversarial patterns and prompt injection attacks, which are increasingly used in sophisticated attacks on AI systems. Data Provenance is the next critical layer, ensuring the authenticity and lineage of data and models. This layer is pivotal in preventing the use of compromised or biased data in AI systems. At the apex is the tactics, techniques, and procedures (TTPs) layer, dealing with the most complex and challenging aspects of AI security. This involves a deep understanding and strategic approach to counter advanced AI-targeted attacks, requiring comprehensive knowledge and planning.

https://arxiv.org/abs/2402.11082








@mlsecops: 🎙️Preview the upcoming episode of The #MLSecOps Podcast! Cybersecurity expert, Sandy Dunn, joins us to cover hot topics like #genAI and strategies for securing AI-powered tech. Be notified when the episode airs! ➡️https://t.co/GsyqCMaMSF #aisecurity #ProtectAI https://t.co/Q7Xve4cR0A
https://twitter.com/mlsecops/status/1760058665422180696


@garak_llm: you could pay for jailbreak prompts - https://t.co/f9qdSMBDoA… - or you could just use garak's DanInTheWild probe to run a set of 666 known good jailbreaks against your LLM, and see if the model blocks them. 395 of these jailbreaks aren't mitigated by gpt-3.5-turbo 😬 https://t.co/PhjoW4oTsv
https://twitter.com/garak_llm/status/1760011930348159487






Your subscriptions:
1. Mikolaj Kowalczyk (@m1k0ww) / Twitterhttps://rss.app/feeds/zrUrrwOgvdBMkJfR.xml
2. Protect AI (@ProtectAICorp) / Twitterhttps://rss.app/feeds/as8Eb9OnBJMq4pBv.xml
3. LassoSecurity (@LassoSecurity) / Twitterhttps://rss.app/feeds/sNDsCHYnfjFLHZOX.xml
4. Adversa AI (@Adversa_AI) / Twitterhttps://rss.app/feeds/RzoiedsVrKZGSeS0.xml
5. LLM Security (@llm_sec) / Twitterhttps://rss.app/feeds/0M3G0rmTDHlXspe3.xml
6. HiddenLayer (@hiddenlayersec) / Twitterhttps://rss.app/feeds/vGtJZ7OqJK6cqPOd.xml
7. CalypsoAI (@calypsoai) / Twitterhttps://rss.app/feeds/y3AzlonBxETwBpJR.xml
8. AI Vulnerability Database (@AvidMldb) / Twitterhttps://rss.app/feeds/OkNUXzuJBtmsikP1.xml
9. OWASP Top 10 For LLM (@LLM_Top10) / Twitterhttps://rss.app/feeds/aJQoPRdbf6v2GCws.xml
10. Johann Rehberger (@wunderwuzzi23) / Twitterhttps://rss.app/feeds/VEj8S78t3A1vc5UY.xml
11. Prompt Security (@prompt_security) / Twitterhttps://rss.app/feeds/gcYspEDLcUszu2OF.xml
12. Nightfall (@NightfallAI) / Twitterhttps://rss.app/feeds/O5f1HoxSiMZv7Dtt.xml
13. Private AI 🥸 (@_PrivateAI) / Twitterhttps://rss.app/feeds/BGgKVRxz1lAjtCv4.xml
14. MLSecOps (@mlsecops) / Twitterhttps://rss.app/feeds/ZcDp2aj2giqRIwd9.xml
15. Cranium.ai (@CraniumAi) / Twitterhttps://rss.app/feeds/2ckGM6GYOnNTGs4P.xml
16. Lakera (@LakeraAI) / Twitterhttps://rss.app/feeds/1w2ADoWEKz1DjHa4.xml
17. Robust Intelligence (@robusthq) / Twitterhttps://rss.app/feeds/PNLDNXRlCaDcEL8N.xml
18. Dan McInerney (@DanHMcInerney) / Twitterhttps://rss.app/feeds/2gurWOLJRRbCl0E0.xml
19. huntr (@huntr_ai) / Twitterhttps://rss.app/feeds/uxKuwrcPHt24AScA.xml
20. Giskard (@giskard_ai) / Twitterhttps://rss.app/feeds/tn1sDSEUpMdldYfe.xml
21. Kobalt Labs (@KobaltLabs) / Twitterhttps://rss.app/feeds/sLBqPaiwHsYJEmMo.xml
22. Machine Learning Security Laboratory (@mlsec_lab) / Twitterhttps://rss.app/feeds/KKWlbjMWEMpM2dyu.xml
23. garak: LLM vulnerability scanner (@garak_llm) / Twitterhttps://rss.app/feeds/5MvsgIV0GwRJbNWk.xml
24. AI Safety Events Tracker (@AISafetyEvents) / Twitterhttps://rss.app/feeds/srnLiyJHQuLnBCNo.xml


Возможно, кто-то хочет поделиться дополнительными источниками, которые можно добавить в ленту. Если это так - то напишите в P.M @wearetyomsmnv
_____________________________________________________________________________________

Perhaps someone would like to share additional sources to add to the feed. If so - then write to P.M @wearetyomsmnv

Какие twitter account сейчас отслеживаются/
Which twitter account is being monitored right now:








@giskard_ai: First article in Journal du Net! 🤯 Our CEO, Alex Combessie, discusses how we assist our French customers with quality assurance for their AI models and our plans to scale globally soon! Key takeaways from this interview: - The motivations behind Giskard's creation
https://twitter.com/giskard_ai/status/1759866017449226367





20 last posts shown.

55

subscribers
Channel statistics