火可以加什么偏旁| 吃什么水果对皮肤好又美白| 安静如鸡什么意思| 做四维需要准备什么| 小心眼什么意思| 头晃动是什么病的前兆| 黄芪配什么不上火| 全麦面包是什么做的| 出汗多什么原因| 减肥吃什么蔬菜| 7月是什么生肖| 腰疼看什么科| 陶渊明字什么| 红绳有什么寓意| 脚酸疼是什么原因引起的吗| 去香港自由行要办什么手续| 左边脖子疼是什么原因| 白头翁是什么动物| 萧敬腾为什么叫雨神| 绞丝旁一个奇念什么| 小水滴会变成什么| 鱼胶是什么东西| 隐睾是什么意思| 日本人什么时候投降的| hscrp高是什么感染| 梦到人死了是什么意思| 卵巢是什么| 什么发什么强| 山茶花是什么颜色| asus是什么牌子| 生化有什么症状和反应| 吃什么东西排酸最快| 病危通知书意味着什么| 百香果和什么搭配好喝| power是什么牌子| 晚餐吃什么减肥| 爱无能是什么意思| 甲鱼补什么| 汗脚是什么原因引起的| hc是什么| 鸭屎香为什么叫鸭屎香| 小孩什么时候长牙| 血痣是什么原因引起的| 芒果是什么季节的| 艾滋通过什么途径传播| 女人性高潮是什么感觉| 喝葡萄糖有什么功效与作用| 像什么似的| 什么是主食| 滑膜炎用什么药治疗最好最快| 吃什么东西补充胶原蛋白| 人皇是什么意思| 梦到蛇是什么意思周公解梦| 出虚汗是什么原因引起的怎么调理| 宫颈疼是什么原因| 眉毛少是什么原因| 丰富是什么意思| 吃米饭配什么菜好吃| 山东古代叫什么| 大地鱼是什么鱼| 头疼是什么原因导致的| 什么不迫| 四次元是什么意思啊| 早上喝蜂蜜水有什么好处| 是什么品牌| 巴厘岛机场叫什么| 林冲为什么叫豹子头| 清鼻涕吃什么药| 上面一个四下面一个正念什么| 涤纶是什么面料优缺点| 萎靡是什么意思| 飞机托运不能带什么| 武警和特警有什么区别| 腿为什么会抽筋| 吃维生素a有什么好处| 2010年什么年| 火疖子挂什么科| 什么是精索静脉曲张| 泌尿感染是什么原因引起的| 上课什么坐姿可以瘦腿| 荔枝什么人不能吃| 头部MRI检查是什么意思| 豆腐有什么营养| 咳嗽吐血是什么原因| 种植什么好| 急性胰腺炎吃什么药| 女生过生日送什么礼物好| 五色土有什么风水作用| 宝贝疙瘩是什么意思| 卯宴席是什么意思| 酸碱度偏高是什么意思| 切片什么意思| 八月17号是什么星座的| 喉咙溃疡吃什么药| 蛐蛐吃什么食物| crocs是什么牌子的鞋| 经常不吃晚饭对身体有什么影响| 吐槽是什么意思| ur品牌属于什么档次| 蟾宫是什么意思| 黑手是什么意思| 碳酸氢钠是什么| dunhill是什么品牌| 子宫内膜薄有什么危害| 水上漂是什么意思| 尿蛋白十一是什么意思| 心脏供血不足吃什么药好| 舌头有问题看什么科| 什么人不能吃绿豆| 转氨酶高吃什么食物降得快| 夸张是什么意思| 头皮痒掉发严重是什么原因| 什么叫地包天| 冰释前嫌是什么意思| 点状血流信号是什么意思| 智叟是什么意思| 哀鸿遍野是什么意思| 肾炎什么症状| 马马虎虎指什么生肖| m蛋白是什么| 阴道红肿是什么原因| 生旦净末丑分别指什么| mlb是什么档次| 泡脚有什么好处和坏处| 下面痒吃什么消炎药| 空灵是什么意思| 粘人是什么意思| 口腔溃疡买什么药| 什么属于包皮过长| 十八大什么时候召开的| 负责任是什么意思| 自己做生意叫什么职业| 什么是跨域| 口臭是什么原因引起的| 覅什么意思| 粉红是什么意思| 劳宫穴在什么位置| 2009属什么生肖| hr医学上是什么意思| 尿蛋白2加是什么意思| 宫是什么意思| 吃什么补气血| 什么是生僻字| 用什么点豆腐最健康| 蚊子有什么用| 1981年属什么| ellesse是什么牌子| 冷泡茶用什么茶叶| 头发的主要成分是什么| 全麦面包是什么做的| 胡子白了是什么原因| 三尖瓣关闭不全是什么意思| 胸闷气短是什么病| 东坡肉属于什么菜系| 海肠是什么东西| 美国什么时候建国的| 记性差是什么原因| 青光眼用什么眼药水| 虾皮不能和什么一起吃| 古今内衣是什么档次| 遗憾是什么| 生肖蛇五行属什么| nba常规赛什么时候开始| ras医学上是什么意思| fox什么意思| 甲状腺不均质改变是什么意思| 吃什么盐最好| 睡不着有什么好办法吗| 尿液少是什么原因| 干眼症用什么药| 脾虚要吃什么东西调理| 什么床垫好| 补肾固精吃什么药好| 禅师是什么意思| 婴儿蓝是什么颜色| 晚饭吃什么减肥| 1969年什么时候退休| 水饮是什么意思| 炸酱面的酱是什么酱| 黄山毛峰属于什么茶| 菌血症是什么病| 纳囊是什么妇科病| 痛风性关节炎吃什么药| 癸亥五行属什么| 人类的祖先是什么生肖| 什么是琉璃| 曾毅玲花什么关系| 甲沟炎用什么药膏好| 牛乳是什么| fast什么意思| 鱼和什么不能一起吃| 肠易激综合征吃什么中成药| 异常什么意思| iron什么意思| 转铁蛋白阳性什么意思| 什么是禅| 烟火气息是什么意思| 百里挑一是什么生肖| 1111是什么意思| VH是什么品牌| 右眼一直跳什么情况| 处理是什么意思| 下午3点是什么时辰| o是什么元素| 堃怎么读什么意思| 成功的反义词是什么| 烟雾病是什么病| 八股文是什么| Rm是什么| 印枭是什么意思| 子宫内膜薄有什么症状| 斐乐属于什么档次| 清考是什么意思| 琥珀五行属什么| 小孩自闭症是什么原因引起的| 手麻疼是什么原因引起| 12月13日是什么星座| 左眉毛跳是什么预兆| 浙大校长什么级别| 身上老出汗是什么原因引起的| 皮肤黄是什么原因| 梦见自己化妆是什么意思| sby是什么意思| 使婢差奴过一生是什么意思| 吃什么下奶最快最多最有效| 摘帽是什么意思| 有缘无分是什么意思| 外婆的弟弟叫什么| 血压低吃什么最快最有效| 申时五行属什么| 荷花什么时候种植| 测怀孕的试纸叫什么| 大地色眼影是什么颜色| 今年80岁属什么生肖| 圣贤是什么意思| mm什么意思| 为什么一个月来两次月经| 龙和什么生肖最配| 腺苷是什么| 鬼佬是什么意思| 口臭吃什么药好| 什么工作轻松| 梦见挖坟墓预示什么| 虱子长什么样| 甲钴胺是什么药| 什么鸡适合炖汤| 便秘有什么症状| 男人阴虱用什么药| 生育津贴是什么| 日成是什么字| lsa是什么意思| 为什么嘴里发苦| 口腔溃疡补充什么维生素| 痛风什么引起的原因有哪些| 天高云淡是什么季节| 三七粉有什么用处| 什么是洗钱| 鸡冲什么生肖| 每天坚持黄瓜敷脸有什么效果| 醉酒第二天吃什么才能缓解难受| 阴唇为什么一大一小| 绿豆和什么相克中毒| 为什么会遗精| 土字旁的字与什么有关| 百度
Skip to Content
0%

日本人学中文的理由千奇百怪:有人称喜欢风水

small language models
Small language models are trained on a small but very specific, high-quality dataset geared toward a single task. These mini models pack a huge punch, for much less. [Creatives on Call]

Small models are typically deployed for a single specific task. They're far less expensive, more efficient, higher performing and, often, more accurate than LLMs.

When it comes to generative AI and the models that drive them, sometimes less is more. Many businesses find that small language models tailored for very specific tasks can be more effective and efficient than large language models (LLMs). Small models are less expensive to train and maintain, and often outperform the kitchen-sink approach of their gigantic multipurpose counterparts. 

Here we’ll explain the appeal of small models, how they work, and how they can benefit your business. You will get answers to these questions:

What is a small language model?

A small language model is a machine-learning algorithm that’s been trained on a dataset much smaller, more specific, and, often, of higher quality than an LLM’s. It has far fewer parameters (the configurations the algorithm learns from data during training) and a simpler architecture. Like LLMs, the advanced AI systems trained on vast amounts of data, small language models can understand and generate human-sounding text.?

Small models are typically deployed for a single specific task (such as answering customer questions about a certain product, summarizing sales calls, or drafting marketing emails) and can be more computationally efficient and faster than LLMs due to their small size and higher-quality, more targeted data. This means you can save money and time, while also improving accuracy by designing topic-specific small language models into your architecture. 

Small language models are not designed, for example, to help you research trends in the healthcare industry. They can, however, help a healthcare company answer customer questions about, say, a new health program for diabetes prevention.

Cost, relevance, and complexity are three important ways small models differ from LLMs.

Why are large language models so expensive?

An LLM is a type of AI that can generate human-like responses by processing natural language inputs, or prompts. This is possible because they’re trained on massive datasets, which gives them an understanding of an expansive range of information.?

All this information processing requires enormous computational resources. The larger the AI model, the higher the cost of training, compute power, and energy — to say nothing of the downstream maintenance costs. OpenAI’s ChatGPT-4, for example, costs over $100 million. Each parameter adds to the price tag, which is multiplied by every piece of input data, known as a token. That’s why even seemingly straightforward tasks like answering a simple question of AI – “What is the capital of Germany?”– are resource-intensive and expensive. 

To put it simply, in many cases, general-purpose LLMs with tens of millions of parameters are overkill for business users who need help with specific tasks. 

“Parameter count is just one of many variables that determine how well an AI deployment can solve problems in the real world,” said Silvio Savarese, executive president and chief scientist at Salesforce. 

(For a deeper look at when LLM scale is, and isn’t, necessary, check out this Q&A with Savarese)

Further, LLMs require huge, high-quality datasets. Acquiring and preprocessing them can be time-consuming and very expensive. Training them adds even more effort and expense: You have to make sure the data is diverse, and that it represents the population it will affect. Setting up and maintaining the required infrastructure (such as cloud computing and specialized hardware) can also be extremely high. 

How are small language models different?

Small, highly trained, task-specific models may be a better option for many companies, regardless of their size. Here’s why: 

Lower cost to serve

LLMs are power-hungry and resource intensive. Small language models require power and resources, too, but because the pool of data they draw from is much smaller and more task- specific, the system requirements (and the ultimate costs) are far lower. And because small models require far fewer compute resources, they consume less power and water than general-purpose models, which helps mitigate cost and their impact on the environment. 

Silvio Savarese, executive president and chief scientist at Salesforce, debunks some myths about small language models.

Better performance

Generative AI relevance — or the degree to which AI outputs are useful, applicable, and aligned to specific business needs — is a vexing business challenge. Business users need clear solutions to specific queries, not the kitchen sink.?

As Savarese wrote in this article, “There’s no substitute for hundreds of billions of parameters when you want to be everything to everyone. But in the enterprise, this ability is almost entirely moot.”

With the right strategy, small language models designed for individual, well-defined tasks, like knowledge retrieval or tech support, can easily outperform larger models.  

Small, open-source models like Salesforce’s xGen consistently exceed the performance of larger models by leveraging better pre-training and data curation strategies. xGen, for example, is trained on longer sequences of data, helping it summarize large volumes of text, write code, and more. 

Greater accuracy

Model accuracy depends on the quality and quantity of the data it’s trained on. Since LLMs are trained on oceans of data pulled from all over the internet, much of it is irrelevant to the business user’s task at hand. Alternatively, small language models like xGen are trained on business data that looks similar to the customer relationship management (CRM) data that a customer might have. 

“xGen is narrowly focused on these specific tasks, and it’s very good at it,” said Kathy Baxter, principal architect, ethical AI practice at Salesforce. 

The models’ small size results in a more focused learning process: They adapt faster to the nuances of particular datasets or applications. This is important for companies looking for specialized AI capabilities because they’re better at handling specific tasks. 

How do small language models enable on-device AI?

Business users on the go can use their phones to access LLMs that live in the cloud. But there are key issues. You need an internet connection, and the performance is only as good as that connection. 

What if you had a small language model that lives on your phone, and works even when you’re offline? Salesforce Research is working on this very thing, with xGen-Mobile, which is tiny enough to fit on a phone but powerful enough to perform tasks accurately and quickly.

This demo shows how a field service technician could use on-device AI to diagnose and solve customer’s problems, without internet connectivity.

The first iterations will be geared toward field service and field sales. In field service, picture a technician diagnosing a washing machine problem onsite. Internet connectivity may be spotty or non-existent in, say, a basement, but that’s not a problem. The technician could access the small language model stored on their device, and instantly get answers to repair questions. 

Future iterations of xGen-Mobile will support multimodal capabilities. For example, if the technician takes a picture of a greasy, broken part, the model would recognize it, making it easy to order a new part. By snapping a picture or even recording sound, the tech could get recommendations for the most likely issues, and ways to address them. 

Another benefit? Keeping the computation on the device can save costs by not sending data to process in the cloud. Further, you can ground the model in the data that’s on your device, and personalize it to your needs.  

“These models can be grounded in information on an individual’s device,” Baxter said. “That means they will eventually be highly personalized, which will make them even more valuable.” 

Stronger data privacy

Unlike some external API-based sources, small language models like XGen adhere to  stringent data privacy controls. This complies with Salesforce’s restrictions on keeping customer data inside its own secured platform. XGen preserves privacy better because the model runs on a mobile device, where the data lives. This is a good solution for sensitive, regulated industries like banking and healthcare, which are restricted in how and and with whom they can share information.

Small but mighty

Small models can be fine-tuned to specific tasks or industries, giving you more relevant and precise outputs without the overhead of processing unnecessary information. This makes them perfect for applications where speed, cost, and accuracy are crucial, delivering specific solutions without the heavyweight footprint.

Get the latest articles in your inbox.

凝血常规是查什么的 全国政协副主席是什么级别 为什么会得甲减 醋有什么功效和作用 aml是什么病
五常指的是什么 晚霞是什么意思 小孩呕吐是什么原因引起的 山竹什么人不能吃 令堂什么意思
手发热是什么原因 芳心暗许什么意思 开水烫伤用什么药膏好得快 糜烂性胃炎吃什么药效果好 感康是什么药
纬字五行属什么 一什么饭 变色龙指什么人 出来混迟早要还的什么意思 大乌龙是什么意思
青蒜是什么hcv8jop9ns0r.cn 五月10号是什么星座hcv7jop7ns3r.cn 孤寡老人国家有什么政策hcv8jop7ns5r.cn 厨娘是什么意思hcv8jop9ns9r.cn bgm是什么hebeidezhi.com
狗狗犬窝咳吃什么药hcv7jop5ns0r.cn 疖是什么意思wzqsfys.com naprogesic是什么药hcv8jop7ns2r.cn 地蛋是什么hcv9jop2ns0r.cn 拍证件照穿什么衣服sanhestory.com
胃肠感冒可以吃什么水果hcv8jop9ns3r.cn 医保自费是什么意思hcv8jop9ns8r.cn 45岁属什么的生肖hcv7jop6ns8r.cn 扁桃体发炎是什么原因引起的hcv9jop2ns8r.cn 啤酒鸭可以放什么配菜hcv9jop3ns7r.cn
世界上最小的花是什么花hcv8jop5ns2r.cn 失眠看什么科hcv9jop2ns9r.cn 什么是备孕hcv7jop5ns3r.cn 失眠吃什么好hcv9jop8ns3r.cn 浑身乏力吃什么药hcv7jop9ns3r.cn
百度