有狐臭是什么原因| 透析是什么意思啊| 高密度脂蛋白胆固醇偏低是什么原因| 4月29号是什么星座| 淘宝预售是什么意思| 支付宝提现是什么意思| 月经量多是什么原因导致的| 木加一笔有什么字| 康养中心是做什么的| 县长是什么级别的干部| 胶原蛋白什么时候喝最好| 孕妇吃黑芝麻对胎儿有什么好处| 毕婚族是什么意思| 栀子泡水喝有什么好处| 喝水不排尿是什么原因| 怀孕拉肚子吃什么药| 为什么会胀气| 身上长水泡是什么原因| 发烧嗓子疼吃什么药好| 7.23什么星座| 外阴起红点是什么病| 胃不舒服吃什么| 心绞痛用什么药最好| 脾胃虚寒吃什么水果好| 感冒喝什么茶| 哕是什么意思| 右肾盂分离是什么意思| 女生做彩超是检查什么| 唱腔是什么意思| 四十年是什么婚| 检查头部应该挂什么科| 巨蟹男和什么座最配对| 手上脱皮什么原因| 手作是什么意思| 三月二十六是什么星座| 中成药是什么药| 矫正是什么意思| 鸾凤和鸣什么意思| 吃什么补内膜最快| 喝酒后肚子疼什么原因| 64是什么| hrs是什么意思| 鸡与什么生肖相合| 胎心快是什么原因| 老年人补什么钙效果最好| 一什么明月| 什么鱼| 稀料是什么| 感染幽门螺旋杆菌吃什么药| 错落有致的意思是什么| 代可可脂是什么| 孕妇什么体质容易晚生| 发烧喝什么饮料比较好| 南方有什么水果| 娇妻是什么意思| 什么是扁平疣图片| 种小麦用什么肥料好| ch4是什么气体| 枫叶是什么颜色的| 大黄和芒硝混合外敷有什么作用| 既济是什么意思| 腹胀是什么原因引起的| 马栗是什么植物| 什么草药能治肿瘤| 77年什么命| 昙花一现是什么意思| 腋下出汗是什么原因| fossil是什么意思| 凌晨两点半是什么时辰| 鱼是什么意思| 厥阴病是什么意思| 叔叔的儿子叫什么| 低血钾吃什么药| 梦见离家出走是什么意思| 什么叫单亲家庭| 廉价什么意思| 熊猫属于什么科| sg比重是什么意思| 什么可以吃| 什么是生僻字| cho是什么意思| 性激素检查是查什么| 为什么会拉水| 幺妹是什么意思| 射手座是什么象星座| 蜻蜓是什么生肖| 血稠吃什么食物好得快| 吃什么治疗阳痿| pw是什么意思| 珊瑚色是什么颜色| 孕早期是什么时候| logo是什么| 爱新觉罗是什么民族| 蕴是什么意思| 糖耐量异常是什么意思| 妹妹是什么意思| hpv阳性是什么病| 篱笆是什么东西| 全麻手术后为什么不能睡觉| 四时感冒什么意思| rca是什么意思| 草朋刀是什么字| 来大姨妈血块多是什么原因| 什么是纸片人| 五脏六腑指的是什么| 牡丹什么时候开花| 啊哈是什么意思| 重阳节又称什么节| 闭经和绝经有什么区别| 冬虫夏草有什么功效与作用| 乌龟用什么呼吸| 什么人不能吃韭菜| 倒牙是什么意思| 26岁属什么生肖| 蛇缠腰是什么病| 4个火读什么| 预防脑出血吃什么药| 什么材质可以放微波炉加热| 吃丝瓜有什么功效和作用| 翻什么覆什么| 什么茶有助于睡眠| 副研究员什么级别| dr是什么检查项目| 血象高是什么原因| 中图分类号是什么| 宫颈hsil是什么意思| 沉甸甸的爱是什么意思| 不过如此是什么意思| 保底工资是什么意思| 文号是什么| 361是什么意思| 鹦鹉为什么会说话| 什么食物补血效果最好最快| 红薯什么季节成熟| 烧腊是什么| 赢字五行属什么| 能说会道是什么生肖| 六小龄童的真名叫什么| 为什么同房会痛| 梦见好多衣服是什么意思| 把握时机是指什么生肖| 水瓶男和什么星座最配| 伊索寓言有什么故事| 梦见搬家是什么预兆| 为什么挠脚心会痒| 回族为什么不吃猪肉| 腺肌症是什么原因引起的| 李时珍的皮是什么意思| 心脾两虚吃什么药| 吃什么补骨髓造血| 疱疹性咽峡炎吃什么药最管用| 什么什么大地| 什么男什么女的成语| 96100是什么电话| 燃气泄露是什么味道| 暂住证和居住证有什么区别| 数位板是什么| 女性血热吃什么好得快| 白夜是什么意思| 大腿痛挂什么科| 四面弹是什么面料| 眼压高有什么症状| 喉咙细菌感染吃什么药| 蜘蛛痣是什么样的| 年糕是什么做的| 碱性磷酸酶低是什么原因| 薄荷泡水喝有什么好处| 风团是什么| 轻度抑郁有什么症状| 英纳格手表什么档次| 生辰八字指什么| 平行宇宙是什么意思| 阴道是什么味道| 月痨病是什么病| 工科和理科有什么区别| 龟头瘙痒是什么原因| 什么的笑| 714什么星座| 后脑勺麻木是什么征兆| 宅是什么意思| 一个口一个麦念什么| 呼呼是什么意思| 红薯的别名叫什么| 怀孕建卡需要什么材料| 妈妈过生日送什么礼物好| 喝酒后手麻是什么原因| 睡眠障碍挂什么科| 左侧淋巴结肿大是什么原因| 查凝血酶能查出什么病| 白细胞低什么原因| 尼古丁是什么东西| 绣球花什么时候开花| 吴亦凡属什么生肖| 二拇指比大拇指长代表什么| 徐才厚什么级别| 肠梗阻是什么症状| 大葱什么时候播种| 脖子下面的骨头叫什么| 什么人不能种生基| 斐然是什么意思| 有趣的什么填空| 婴儿头发长得慢是什么原因| 老爹鞋适合什么人穿| 百香果有什么好处功效| 丙烯颜料用什么洗掉| 人心隔肚皮什么意思| 竹棉和纯棉有什么区别| 扁平苔藓是什么病| 下嘴唇发麻什么病兆| 建档需要准备什么资料| 7.30是什么星座| 疑似是什么意思| 脱水有什么症状| 眉尾上方有痣代表什么| 什么是窦性心律不齐| 看看我有什么| 饶舌是什么意思| 脚上真菌感染用什么药| 瘥是什么意思| pass是什么意思| 1993属什么生肖| 孟字五行属什么| 细菌性前列腺炎有什么症状| 藿香是什么| 生粉是什么| 酌情处理是什么意思| 吃维c有什么好处| 温州特产是什么| 1120是什么星座| plump什么意思| 麦粒肿涂什么药膏| 小孩内热吃什么药| 异什么意思| 芮字五行属什么| 游丝是什么意思| 蛇盘疮吃什么药| 白天嗜睡是什么原因| 犹豫不决是什么生肖| 过年吃什么| 曼字五行属什么| 什么叫生僻字| 中山大学是什么级别| 鸭子什么时候下蛋| 烂脚丫用什么药| 恋爱脑是什么意思| 青汁是什么| 枫树叶子像什么| 脚腕筋疼是什么原因| 尿胆原阳性是什么病| 阿尔山在内蒙古什么地方| 海马是什么动物| 粉瘤挂什么科| 什么鱼做酸菜鱼最好吃| 西洋参补什么| 什么文什么字| 12月9号是什么星座| 为什么做完爱下面会疼| 十月一日是什么日子| 为什么一热就头疼| 悬雍垂发炎吃什么药| 为什么会磨牙| 女s是什么| 龟头敏感用什么药| 当家做主是什么生肖| 百度
Skip to Content
0%

脂肪肝吃什么食物

AI-powered solutions like Salesforce CRM are revolutionizing customer engagement, streamlining workflows, and providing deeper insights into customer needs. However, with the rise of large language models (LLMs), new security challenges have emerged. One significant threat is prompt injection attacks, which attempt to manipulate AI systems through carefully crafted inputs. As Salesforce integrates AI into its CRM tools, understanding and protecting against these vulnerabilities is essential for safeguarding data, reputation, and customers.

Failing to address emerging threats, such as prompt injection, could result in data breaches, compromised system integrity, and erosion of customer trust. It is crucial for organizations to proactively implement robust security measures. This blog details the AI Research team’s work on developing and implementing reliable solutions to protect Salesforce applications against prompt injection attacks. Our goal is to ensure the ongoing safety and effectiveness of our AI-enhanced CRM tools.

What is Prompt Injection?

In AI systems, a “prompt” refers to instructions given to an AI application in order to perform a specific task. The LLMs powering Salesforce’s AI applications use prompts and other inputs provided by our users to generate responses. The system returns these responses to the user. The generative nature of LLMs makes them susceptible to carefully crafted prompt engineering attacks. A prompt injection attack refers to a malicious prompt designed to elicit unintended information or fraudulent actions from an LLM. Prompt injection attacks exploit an LLM’s instruction following ability and may trick them into bypassing security policies, disclosing sensitive data, or producing harmful content. Recently, Copilot for Microsoft 365 was shown to be vulnerable to prompt injection attempts. Similarly, bad actors can design prompts with malicious intent that may seek to exploit Salesforce’s AI applications for similarly nefarious purposes.

At Salesforce, trust is our #1 value. We design AI applications with trust at their core, that our customers can safely use. The Salesforce AI Research team builds models and detectors to identify prompts that may be adversarial in nature. With the advent of agentic workflows, and LLMs having access to a plethora of tools, datasets etc., detecting and deflecting prompt injection attempts is of vital importance.

Safeguarding Salesforce AI Against Prompt Injection

In order to safeguard Salesforce and customer assets from prompt injection attempts, we explored different research paths. One possible intervention is to develop a system capable of analyzing user prompts and assessing their safety. To this end, the AI research team develops classifiers and heuristic methods. These methods identify malicious intent in prompts with high accuracy. The following section outlines steps taken to design, build, and evaluate such a system.

Design: Creating a Taxonomy

Before we could begin training a reliable prompt injection detection model, we had to design its taxonomy. A thoughtful taxonomy is essential for any machine learning classifier. Developing models to detect prompt injection attempts is an iterative process, and a well-structured taxonomy allows us to reliably evaluate (and improve) the performance on specific inputs. The table below showcases the seven prompt injection variants that are relevant to the CRM threat model.

Type      Description
Pretending/ Role-playInstructing the LLM/agent to assume the role of a different “system persona” with malicious intent. Social engineering attacks such as deceiving the system with adversarial conversational content
Privilege Escalation/ Attempts to change core system rulesInjecting malicious instructions that aim to bypass/change existing system policies and the LLM safety training. E.g. Do Anything Now (DAN) jailbreak attacks
Prompt Leakage Prompts intending to leak sensitive information from the LLM prompt such as the system policies and contextual knowledge documents. This is for the purpose of active reconnaissance
Adversarial SuffixA set of seemingly random character encodings appended to a prompt. It is designed to circumvent guardrails and alignme
Privacy AttacksPrompts that attempt to extract, infer, or expose personal or confidential data. This is with the aim of unauthorized access or misus
Malicious Code GenerationPrompts attempting to generate malicious code outputs from an LLM. E.g. creating malware, viruses, fraud utilities etc.

With a taxonomy in hand, we were able to begin training our classifier, which is discussed in the next section. Developing this taxonomy is an iterative process performed by the AI research team in collaboration with Salesforce security, product and ethics teams. 

Build: Gathering Data

After carefully defining the above taxonomy, we procured high-quality data to train and benchmark our injection detector. It was important that we curated the data points which supported our proposed taxonomy. We use a mix of open source datasets published by the community on prompt injection scenarios and jailbreak attempts, along with other CRM-related prompts.

We worked cross-functionally with an internal annotation team as well as the Office of Ethical and Humane Use (OEHU) to ensure reliable, relevant, and labeled training data. OEHU continually provided expert assistance and clarification throughout the data labeling process. Simultaneously, the legal team helped us ensure that we only use permissible datasets for classifier training. This collaboration was crucial in aligning our model with Salesforce’s commitment to trust and safety.

Augmenting open-source datasets that have limited data samples for one or more target categories is crucial to training a dependable classifier. In addition to human annotation, we utilized synthetic data to bolster our training datasets. When faced with such categorical short-comings, we turned to our in-house synthetic data generation pipelines. The resultant pipelines leveraged techniques such as zero-shot and few-shot LLM prompting, LLM self-correction of labels, and LLM content editing to inject harmful content in safe texts (a.k.a., data “mutation”). The combination of synthetic data generation techniques, coupled with human annotation allowed us to create diverse training data that is well-balanced across different classes in taxonomy, has control over subtle differences between safe and unsafe content, and is tailored to various CRM use cases.

Evaluation: Implementing a Feedback Loop

Our iterative training process consisted of a feedback loop with four phases: training, testing, red teaming, and (re)evaluation. The goal was to cycle through these phases as often as needed to develop a model that met our performance expectations. 

After each round of training, we benchmark the model’s performance on a variety of test sets according to our taxonomy. Following initial testing, we red teamed our model checkpoints, simulating attacks and stress-testing the models by introducing challenging inputs. We utilized our internal automated red teaming library, fuzzai, to build our red teaming suite. 

The final phase, evaluation, combined results from testing and red teaming to analyze the collective outcomes. This analysis, particularly of the red teaming results, helped us identify potential weaknesses in our model, for improvement in the next round of our feedback loop.

We utilize this process to build multiple iterations of our prompt injection detection model, as well as other detectors deployed to Salesforce’s Trust Layer. The prompt injection model assigns probability scores to user prompts along with the labels. This allows an intervention before sending them to an Agent or LLM for execution.

Conclusion: Enhancing Security for a Safer AI-Powered CRM

Prompt injection attacks highlight the importance of ongoing security monitoring for AI-powered CRM systems. By leveraging Salesforce’s robust defense mechanisms and staying informed about emerging threats, you can help ensure that your CRM is protected against the evolving landscape of AI vulnerabilities. We continually evaluate our prompt injection detection classifier against open source detector, external LLMs, and other third-party solutions. Embrace AI with confidence—knowing that your Salesforce CRM defends against prompt injection and other security risks.

With these protections in place, Salesforce customers can continue to benefit from the powerful capabilities of AI while keeping sensitive information secure. 

Acknowledgments

  • Yixin Mao, Vera Vetter, Jason Wu

Explore more

  • Salesforce AI Website: www.salesforceairesearch.com
  • Follow us on Twitter: @SFResearch, @Salesforce

Get the latest articles in your inbox.

白带清洁度lll度是什么意思 脚底褪皮是什么原因 pci是什么意思 不为良相便为良医是什么意思 什么食物养胃又治胃病
daddy是什么意思 胃炎可以吃什么水果 局部癌变是什么意思 新生儿拉肚子是什么原因引起的 屏保什么意思
肝肿瘤不能吃什么 组织是什么意思 alt医学上是什么意思 感冒是挂什么科 身上长小肉揪是什么原因
handmade是什么牌子 口腔溃疡吃什么食物 肾功能不全有什么症状 人格是什么意思 吃完饭就想睡觉是什么原因
孕早期吃什么水果好hcv8jop4ns9r.cn 晚上睡觉尿多是什么原因hcv8jop9ns9r.cn 早上八点到九点属于什么时辰hcv9jop3ns3r.cn 轻断食什么意思hcv8jop0ns5r.cn 欧金金什么意思hcv7jop9ns5r.cn
脑供血不足什么原因hcv9jop1ns8r.cn 刘欢属什么生肖hcv7jop6ns6r.cn 天地不仁以万物为刍狗是什么意思hcv9jop1ns5r.cn 什么是高脂肪食物hcv9jop4ns6r.cn 回盲瓣呈唇形什么意思hcv9jop2ns0r.cn
头发没有光泽是什么原因hcv9jop7ns0r.cn 皮肤黑穿什么颜色的衣服显白hcv9jop2ns2r.cn 异常灌注是什么意思hcv8jop4ns7r.cn 上火吃什么最快能降火bjcbxg.com 破伤风伤口有什么症状hcv9jop7ns1r.cn
1991年属羊是什么命hcv8jop9ns8r.cn 带银饰有什么好处hcv9jop2ns0r.cn 咳嗽吐黄痰是什么原因hcv7jop4ns6r.cn 生源地是什么意思hcv8jop6ns2r.cn 芥酸对身体有什么危害hcv8jop5ns3r.cn
百度