阿里发布万亿参数AI大模型M6，相比英伟达、谷歌算力消耗降八成

赛迪网2021-06-25

6月25日，阿里巴巴达摩院发布“低碳版”巨模型M6，在全球范围内首次大幅降低万亿参数超大模型训练能耗。通过一系列突破性的技术创新，达摩院团队仅使用480卡GPU，即训练出了规模达人类神经元10倍的万亿参数多模态大模型M6，与英伟达、谷歌等海外公司实现万亿参数规模相比，能耗降低超八成、效率提升近11倍。大模型将成下一代人工智能基础设施，在AI界已成共识。与生物体神经元越多往往越聪明类似，参数规模越大...

网页链接

免责声明：本文观点仅代表作者个人观点，不构成本平台的投资建议，本平台不对文章信息准确性、完整性和及时性做出任何保证，亦不对因使用或信赖文章信息引发的任何损失承担责任。

精彩评论

加菲虎
2021-06-26
加菲虎
阿里加油！

发表看法

{"i18n":{"language":"zh_CN"},"isChannel":false,"data":{"share":"https://www.laohu8.com/m/news/2146902052?lang=zh_CN&edition=full","thumbnail":"https://static.tigerbbs.com/f13f9296c3b006f981500ed5d2af766f","is_english":false,"pubTime":"2021-06-25 15:15","share_image_url":"https://static.laohu8.com/9a95c1376e76363c1401fee7d3717173","id":"2146902052","market":"us","top_or_hot":-1,"title":"阿里发布万亿参数AI大模型M6，相比英伟达、谷歌算力消耗降八成","media":"赛迪网","content":"<div>\n<p>6月25日，阿里巴巴达摩院发布“低碳版”巨模型M6，在全球范围内首次大幅降低万亿参数超大模型训练能耗。通过一系列突破性的技术创新，达摩院团队仅使用480卡GPU，即训练出了规模达人类神经元10倍的万亿参数多模态大模型M6，与英伟达、谷歌等海外公司实现万亿参数规模相比，能耗降低超八成、效率提升近11倍。大模型将成下一代人工智能基础设施，在AI界已成共识。与生物体神经元越多往往越聪明类似，参数规模越大...</p>\n\n<a href=\"http://gu.qq.com/resources/shy/news/detail-v2/index.html#/?id=nesSN202106251521127c18e476&s=b\">网页链接</a>\n\n</div>\n","source":"tencent","html":"<!DOCTYPE html>\n<html>\n<head>\n<meta http-equiv=\"Content-Type\" content=\"text/html; charset=utf-8\" />\n<meta name=\"viewport\" content=\"width=device-width,initial-scale=1.0,minimum-scale=1.0,maximum-scale=1.0,user-scalable=no\"/>\n<meta name=\"format-detection\" content=\"telephone=no,email=no,address=no\" />\n<title>阿里发布万亿参数AI大模型M6，相比英伟达、谷歌算力消耗降八成</title>\n<style type=\"text/css\">\na,abbr,acronym,address,applet,article,aside,audio,b,big,blockquote,body,canvas,caption,center,cite,code,dd,del,details,dfn,div,dl,dt,\nem,embed,fieldset,figcaption,figure,footer,form,h1,h2,h3,h4,h5,h6,header,hgroup,html,i,iframe,img,ins,kbd,label,legend,li,mark,menu,nav,\nobject,ol,output,p,pre,q,ruby,s,samp,section,small,span,strike,strong,sub,summary,sup,table,tbody,td,tfoot,th,thead,time,tr,tt,u,ul,var,video{ font:inherit;margin:0;padding:0;vertical-align:baseline;border:0 }\nbody{ font-size:16px; line-height:1.5; color:#999; background:transparent; }\n.wrapper{ overflow:hidden;word-break:break-all;padding:10px; }\nh1,h2{ font-weight:normal; line-height:1.35; margin-bottom:.6em; }\nh3,h4,h5,h6{ line-height:1.35; margin-bottom:1em; }\nh1{ font-size:24px; }\nh2{ font-size:20px; }\nh3{ font-size:18px; }\nh4{ font-size:16px; }\nh5{ font-size:14px; }\nh6{ font-size:12px; }\np,ul,ol,blockquote,dl,table{ margin:1.2em 0; }\nul,ol{ margin-left:2em; }\nul{ list-style:disc; }\nol{ list-style:decimal; }\nli,li p{ margin:10px 0;}\nimg{ max-width:100%;display:block;margin:0 auto 1em; }\nblockquote{ color:#B5B2B1; border-left:3px solid #aaa; padding:1em; }\nstrong,b{font-weight:bold;}\nem,i{font-style:italic;}\ntable{ width:100%;border-collapse:collapse;border-spacing:1px;margin:1em 0;font-size:.9em; }\nth,td{ padding:5px;text-align:left;border:1px solid #aaa; }\nth{ font-weight:bold;background:#5d5d5d; }\n.symbol-link{font-weight:bold;}\n/* header{ border-bottom:1px solid #494756; } */\n.title{ margin:0 0 8px;line-height:1.3;color:#ddd; }\n.meta {color:#5e5c6d;font-size:13px;margin:0 0 .5em; }\na{text-decoration:none; color:#2a4b87;}\n.meta .head { display: inline-block; overflow: hidden}\n.head .h-thumb { width: 30px; height: 30px; margin: 0; padding: 0; border-radius: 50%; float: left;}\n.head .h-content { margin: 0; padding: 0 0 0 9px; float: left;}\n.head .h-name {font-size: 13px; color: #eee; margin: 0;}\n.head .h-time {font-size: 11px; color: #7E829C; margin: 0;line-height: 11px;}\n.small {font-size: 12.5px; display: inline-block; transform: scale(0.9); -webkit-transform: scale(0.9); transform-origin: left; -webkit-transform-origin: left;}\n.smaller {font-size: 12.5px; display: inline-block; transform: scale(0.8); -webkit-transform: scale(0.8); transform-origin: left; -webkit-transform-origin: left;}\n.bt-text {font-size: 12px;margin: 1.5em 0 0 0}\n.bt-text p {margin: 0}\n</style>\n</head>\n<body>\n<div class=\"wrapper\">\n<header>\n<h2 class=\"title\">\n阿里发布万亿参数AI大模型M6，相比英伟达、谷歌算力消耗降八成\n</h2>\n\n<h4 class=\"meta\">\n\n\n2021-06-25 15:15 北京时间&nbsp;&nbsp;&nbsp;<a href=http://gu.qq.com/resources/shy/news/detail-v2/index.html#/?id=nesSN202106251521127c18e476&s=b><strong>赛迪网</strong></a>\n\n\n</h4>\n\n</header>\n<article>\n<div>\n<p>6月25日，阿里巴巴达摩院发布“低碳版”巨模型M6，在全球范围内首次大幅降低万亿参数超大模型训练能耗。通过一系列突破性的技术创新，达摩院团队仅使用480卡GPU，即训练出了规模达人类神经元10倍的万亿参数多模态大模型M6，与英伟达、谷歌等海外公司实现万亿参数规模相比，能耗降低超八成、效率提升近11倍。大模型将成下一代人工智能基础设施，在AI界已成共识。与生物体神经元越多往往越聪明类似，参数规模越大...</p>\n\n<a href=\"http://gu.qq.com/resources/shy/news/detail-v2/index.html#/?id=nesSN202106251521127c18e476&s=b\">网页链接</a>\n\n</div>\n\n\n</article>\n</div>\n</body>\n</html>\n","isBrief":false,"type":0,"news_type":1,"symbol":"NVDA","symbol_name":"英伟达","start_time":0,"source_url":"http://gu.qq.com/resources/shy/news/detail-v2/index.html#/?id=nesSN202106251521127c18e476&s=b","article_id":"2146902052","we_media_id":null,"thumbnails":["https://static.tigerbbs.com/f13f9296c3b006f981500ed5d2af766f"],"rights":{"source":"tencent","url":"http://gu.qq.com/resources/shy/news/detail-v2/index.html#/?id=nesSN202106251521127c18e476&s=b","rn_cache_url":null,"customStyle":"body{padding-top:10px;}#news_title{font-weight:bold;#titleStyle#;}#news_description span{font-size:12px;#descriptionStyle#;}.footer-note{#statement#}","selectors":".mod-LoadTzbdNews, body","filters":".relate-stock, .hot-list, .recom-box, .wx-sou","directOrigin":true},"url":"https://stock-news.laohu8.com/highlight/detail?id=2146902052","pubTimestamp":1624605300,"sourceInfo":{"source_id":"tencent","name":"腾讯"},"weMediaInfo":null,"summary":"6月25日，阿里巴巴达摩院发布“低碳版”巨模型M6，在全球范围内首次大幅降低万亿参数超大模型训练能耗。通过一系列突破性的技术创新，达摩院团队仅使用480卡GPU，即训练出了规模达人类神经元10倍的万亿参数多模态大模型M6，与英伟达、谷歌等海外公司实现万亿参数规模相比，能耗降低超八成、效率提升近11倍。","collect":0,"end_time":0,"defaultTopTitle":"qq.com","property":[],"viewcount":null,"language":"zh","relate_stocks":{"NVDA":"英伟达","BABA":"阿里巴巴","GOOGL":"谷歌A","09988":"阿里巴巴-W","09086":"华夏纳指-U","GOOG":"谷歌","03086":"华夏纳指","QNETCN":"纳斯达克中美互联网老虎指数"},"translate_title":"Ali released the trillion-parameter AI large model M6, which reduced Alphabet's computing power consumption by 80% compared with NVIDIA Corp","themeId":null,"isJumpTheme":false,"ttsUrl":null,"symbols_score_info":{"48392":0.6,"48394":0.6,"48453":0.6,"48454":0.6,"48469":0.6,"48473":0.6,"48522":0.6,"48566":0.6,"48579":0.6,"48580":0.6,"48682":0.6,"48683":0.6,"48687":0.6,"48710":0.6,"NVDA":1,"GOOGL":1,"GOOG":1,"QNETCN":0.6,"BABA":0.6,"03086":0.6,"09988":0.6,"09086":0.6},"content_text":"6月25日，阿里巴巴达摩院发布“低碳版”巨模型M6，在全球范围内首次大幅降低万亿参数超大模型训练能耗。通过一系列突破性的技术创新，达摩院团队仅使用480卡GPU，即训练出了规模达人类神经元10倍的万亿参数多模态大模型M6，与英伟达、谷歌等海外公司实现万亿参数规模相比，能耗降低超八成、效率提升近11倍。大模型将成下一代人工智能基础设施，在AI界已成共识。与生物体神经元越多往往越聪明类似，参数规模越大的AI模型，往往拥有更高的智慧上限，训练大模型或将让人类在探索通用人工智能上更进一步。然而，大模型算力成本也相当高昂，很大程度阻碍了学界、工业界对大模型潜力的深入研究。针对这一难题，达摩院及阿里云等团队改进了MOE（Mixture-of-Experts）框架，创造性地通过专家并行策略，大大扩增了单个模型的承载容量。同时，通过加速线性代数、混合精度训练、半精度通信等优化技术，达摩院团队大幅提升了万亿模型训练速度，且在效果接近无损的前提下有效降低了所需计算资源。相比此前英伟达使用3072 A100 GPU实现万亿参数、谷歌使用2048 TPU实现1.6万亿参数大模型，此次达摩院仅使用480卡V100 32G GPU就实现了万亿模型M6，节省算力资源超80%，且训练效率提升近11倍。同时，达摩院此次发布的M6巨模型，成为国内首个实现商业化落地的多模态大模型。M6拥有超越传统AI的认知和创造能力，擅长绘画、写作、问答，在电商、制造业、文学艺术等诸多领域拥有广泛应用前景。据了解，经过一段时间的试用，M6将作为AI助理设计师正式上岗阿里新制造平台犀牛智造，通过结合潮流趋势进行快速设计、试穿效果模拟，有望大幅缩短快时尚新款服饰设计周期。M6还已应用于支付宝、淘宝等平台，参与跨模态搜索、文案撰写、图片设计等工作。达摩院资深算法专家杨红霞表示，“接下来，M6团队将继续把低碳AI做到极致，推进应用进一步落地，并探索对通用大模型的理论研究。”今年以来，阿里在超大规模预训练模型领域屡出成果。除发布多模态巨模型M6外，阿里巴巴达摩院近期还发布了中文社区领先的语言大模型PLUG，实现了在AI大模型底层技术及应用上的深入布局。","kind":"news","is_publish_news":true,"is_publish_highlight":false,"is_publish_live":false,"is_publish_wemedia":null,"editions":null,"column":"","sentiment":"0","news_tag":"","news_rank":0,"symbols":[],"gpt_button":1},"commentList":[{"id":125132250,"gmtCreate":1624663463770,"gmtModify":1624663463770,"author":{"id":"3494553288861128","authorId":"3494553288861128","name":"加菲虎","avatar":"https://static.laohu8.com/default-avatar.jpg","vip":1,"crmLevel":1,"crmLevelSwitch":0,"idStr":"3494553288861128","authorIdStr":"3494553288861128"},"htmlText":"阿里加油！","listText":"阿里加油！","text":"阿里加油！","images":[],"top":1,"highlighted":1,"essential":1,"paper":1,"likeSize":0,"commentSize":0,"repostSize":0,"link":"https://laohu8.com/post/125132250","repostId":2146902052,"repostType":2,"isVote":1,"likeStatus":false,"favoriteStatus":false,"reportStatus":false,"tweetType":1,"langContent":"CN"}],"isCommentEnd":false,"newsSizeData":{"likeSize":1,"commentSize":1,"repostSize":0,"favoriteSize":1,"likeStatus":false,"favoriteStatus":false},"APP":{"userAgent":"Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)","isDev":false,"isTTM":false,"deviceId":"web-server-community-laohu8-v3","version":"4.29.3","shortVersion":"4.29.3","platform":"web","vendor":"web","appName":"laohu8","isIOS":false,"isAndroid":false,"isTiger":false,"isTHS":false,"isWeiXin":false,"isWeiXinMini":false,"isWeiBo":false,"isQQ":false,"isBaiduSwan":false,"isBaiduBox":false,"isDingTalk":false,"isToutiao":false,"isOnePlus":false,"isHuaWei":false,"isXiaomi":false,"isXiaomiWebView":false,"isOppo":false,"isVivo":false,"isSamsung":false,"isMobile":false},"href":"/m/news/2146902052"}