{"id":1071030,"date":"2023-01-11T01:04:00","date_gmt":"2023-01-11T09:04:00","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?post_type=msr-blog-post&#038;p=1071030"},"modified":"2024-09-25T03:29:02","modified_gmt":"2024-09-25T10:29:02","slug":"nni-pruning","status":"publish","type":"msr-blog-post","link":"https:\/\/www.microsoft.com\/en-us\/research\/articles\/nni-pruning\/","title":{"rendered":"\u9ad8\u7cbe\u5ea6\u538b\u7f29Transformer\uff0cNNI\u526a\u679d\u4e00\u7ad9\u5f0f\u6307\u5357"},"content":{"rendered":"\n<p>\u65e0\u8bba\u5728\u5b66\u672f\u754c\u8fd8\u662f\u4ea7\u4e1a\u754c\uff0c\u4eca\u5e74\u4eba\u5de5\u667a\u80fd\u5927\u6a21\u578b\u90fd\u662f\u7206\u6b3e\u8bdd\u9898\u3002\u4f46\u9762\u5bf9\u8fd9\u4e9b\u52a8\u4e0d\u52a8\u5c31\u6570\u5341\u4ebf\u7ea7\u522b\u53c2\u6570\u7684\u6a21\u578b\uff0c\u4f7f\u7528\u4f20\u7edf\u65b9\u6cd5\u5fae\u8c03\uff0c\u5b9b\u5982\u6c34\u4e2d\u635e\u6708\u3001\u6d77\u5e95\u635e\u9488\u3002\u4f5c\u4e3a\u5fae\u8f6f\u4e9a\u6d32\u7814\u7a76\u9662\u4e3a\u79d1\u7814\u4eba\u5458\u548c\u7b97\u6cd5\u5de5\u7a0b\u5e08\u91cf\u8eab\u5b9a\u5236\u7684\u4e00\u7ad9\u5f0f AutoML\uff08\u81ea\u52a8\u673a\u5668\u5b66\u4e60\uff09\u5de5\u5177\uff0c NNI\uff08Neural Network Intelligence\uff09\u5728\u8fc7\u53bb\u7684\u4e09\u5e74\u95f4\u4e0d\u65ad\u8fed\u4ee3\u66f4\u65b0\uff0c\u52a0\u5f3a\u4e86\u5bf9\u5404\u79cd\u5206\u5e03\u5f0f\u8bad\u7ec3\u73af\u5883\u7684\u652f\u6301\uff0c\u6210\u4e3a\u4e86\u6700\u70ed\u95e8\u7684 AutoML \u5f00\u6e90\u9879\u76ee\u4e4b\u4e00\u3002<\/p>\n\n\n\n<p>\u8fd1\u65e5\uff0c\u5fae\u8f6f\u4e9a\u6d32\u7814\u7a76\u9662\u5bf9 NNI \u8fdb\u884c\u4e86\u66f4\u65b0\u3002\u5728\u6700\u65b0\u7684\u7248\u672c\u4e2d\uff0cNNI \u96c6\u6210\u4e86\u5927\u91cf\u524d\u6cbf\u7684\u526a\u679d\u7b97\u6cd5\uff0c\u5982 TaylorFO Weight\u3001Movement \u7b49\u3002\u57fa\u4e8e\u73b0\u6709\u7684\u7ecf\u5178\u9884\u8bad\u7ec3\u6a21\u578b\uff0c\u7814\u7a76\u5458\u4eec\u901a\u8fc7\u5927\u91cf\u5b9e\u9a8c\uff0c\u53d1\u73b0\u4e86\u65e2\u80fd\u964d\u4f4e\u6a21\u578b\u53c2\u6570\u91cf\u548c\u8ba1\u7b97\u91cf\uff0c\u53c8\u80fd\u4fdd\u6301\u6a21\u578b\u8f83\u9ad8\u7cbe\u5ea6\u7684\u526a\u679d\u6b65\u9aa4\u4e0e\u7b97\u6cd5\u7ec4\u5408\uff0c\u83b7\u5f97\u8d85\u8d8a SOTA \u7684\u6a21\u578b\u526a\u679d\u6548\u679c\u3002<\/p>\n\n\n\n<p>\u4eca\u5929\u6211\u4eec\u5c31\u4ee5 Transformer \u7cfb\u5217\u7684\u9884\u8bad\u7ec3\u6a21\u578b\u548c\u6570\u636e\u96c6 GLUE-MNLI \u4e3a\u4f8b\uff0c\u4e3a\u5927\u5bb6\u4ecb\u7ecd\u4e00\u4e0b NNI \u7684 pruner \u526a\u679d\u6d41\u7a0b\u548c\u4f7f\u7528\u7684\u526a\u679d\u7b97\u6cd5\u7ec4\u5408\u3002<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"\u526a\u679d\u6d41\u7a0b\">\u526a\u679d\u6d41\u7a0b<\/h2>\n\n\n\n<p>\u5728\u6b63\u5f0f\u4ecb\u7ecd\u526a\u679d\u6d41\u7a0b\u524d\uff0c\u6211\u4eec\u9700\u8981\u5148\u4e86\u89e3\u4ec0\u4e48\u662f pruner\uff0cmask \u548c SpeedUp\u3002<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li>pruner\uff1a\u4f7f\u7528\u5177\u4f53\u7684\u526a\u679d\u7b97\u6cd5\u5b9e\u4f8b\u5316\u7684\u526a\u679d\u5668\u3002<\/li>\n\n\n\n<li>mask\uff1a\u5728\u526a\u679d\u8fc7\u7a0b\u4e2d\uff0cpruner \u4f1a\u751f\u6210\u4e00\u4e2a\u548c\u76ee\u6807\u5b50\u6a21\u5757\u5927\u5c0f\u76f8\u540c\u7684 mask\uff08\u51681\uff09\u77e9\u9635\uff0c\u5e76\u5728 mask \u77e9\u9635\u4e2d\u5c06\u76ee\u6807\u5b50\u6a21\u5757\u4e2d\u9700\u8981\u526a\u6389\u7684\u90e8\u5206\u7684\u5bf9\u5e94\u4f4d\u7f6e\u7f6e\u4e3a0\u3002\u6700\u540e\u901a\u8fc7\u5c06\u76ee\u6807\u5b50\u6a21\u5757\u548c\u5bf9\u5e94\u7684 mask \u77e9\u9635\u76f8\u4e58\uff0c\u5373\u53ef\u5f97\u5230\u6a21\u62df\u526a\u679d\u540e\u7684\u6a21\u578b\u6548\u679c\u3002<\/li>\n\n\n\n<li>SpeedUp\uff1a\u4ece\u4e0a\u8ff0\u63cf\u8ff0\u53ef\u4ee5\u770b\u51fa\uff0c\u5728\u526a\u679d\u8fc7\u7a0b\u4e2d\uff0c\u5b9e\u9645\u4e0a\u53ea\u662f\u5c06\u9700\u8981\u526a\u679d\u7684\u90e8\u5206\u75280\u8fdb\u884c\u4e86\u66ff\u6362\uff0c\u56e0\u6b64\u4f7f\u7528 SpeedUp \u6a21\u5757\u662f\u4fee\u526a\u4e0a\u8ff0\u76ee\u6807\u5b50\u6a21\u5757\u4e2d\u9700\u8981\u526a\u6389\u7684\u53c2\u6570\uff0c\u800c\u4e0d\u662f\u75280\u66ff\u4ee3\uff0c\u4ece\u800c\u5b9e\u73b0\u771f\u6b63\u610f\u4e49\u4e0a\u7684\u51cf\u5c11\u53c2\u6570\u91cf\u3002<\/li>\n<\/ul>\n\n\n\n<p>\u5728\u4f7f\u7528 NNI Compression \u6a21\u5757\u4e2d\u7684 pruner \u8fdb\u884c\u526a\u679d\u64cd\u4f5c\u65f6\uff0c\u7528\u6237\u53ea\u9700\u5b8c\u6210\u6570\u636e\/\u6a21\u578b\u7b49\u7684\u51c6\u5907\u3001pruner \u7684\u6784\u5efa\uff0c\u4ee5\u53ca\u6a21\u578b\u526a\u679d\u548c\u518d\u8bad\u7ec3\uff0c\u5373\u53ef\u4e3a\u6a21\u578b\u6784\u5efa\u4e00\u4e2a\u526a\u679d\u7684 pipeline\u3002<\/p>\n\n\n\n<p>\u4ee5 Transformer \u7cfb\u5217\u7684\u9884\u8bad\u7ec3\u6a21\u578b\u4e3a\u4f8b\uff0c\u5176\u526a\u679d\u6d41\u7a0b\u5171\u5305\u542b4\u6b65\uff1a\u9996\u5148\u51c6\u5907\u6570\u636e\/\u6a21\u578b\u7b49\uff0c\u63a5\u7740\u9488\u5bf9\u591a\u5934\u81ea\u6ce8\u610f\u529b\u673a\u5236\uff08Multi-head Attention\uff09\u3001\u5d4c\u5165\u5c42\uff08embedding\uff09\u548c\u524d\u9988\u795e\u7ecf\u7f51\u7edc\uff08FFN\uff09\u5206\u522b\u526a\u679d\u548c\u518d\u8bad\u7ec3\u6a21\u578b\u3002<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"228\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-1-1024x228.jpg\" alt=\"\u56fe1\uff1aTransformer \u7cfb\u5217\u6a21\u578b\u7684\u526a\u679d\u6d41\u7a0b\u793a\u610f\u56fe\" class=\"wp-image-1071036\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-1-1024x228.jpg 1024w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-1-300x67.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-1-768x171.jpg 768w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-1-1536x341.jpg 1536w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-1-2048x455.jpg 2048w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-1-240x53.jpg 240w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption class=\"wp-element-caption\"><em>\u56fe1\uff1aTransformer \u7cfb\u5217\u6a21\u578b\u7684\u526a\u679d\u6d41\u7a0b\u793a\u610f\u56fe<\/em><\/figcaption><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<p><strong>1. \u51c6\u5907\u6570\u636e\/\u6a21\u578b\u7b49<\/strong><\/p>\n\n\n\n<p>\u5728\u6b63\u5f0f\u6784\u5efa\u526a\u679d\u8fc7\u7a0b\u4e4b\u524d\uff0c\u7528\u6237\u9700\u8981\u52a0\u8f7d\u9884\u8bad\u7ec3\u6a21\u578b\uff0c\u5bf9\u6570\u636e\u9884\u5904\u7406\u5e76\u521b\u5efa\u76f8\u5e94\u7684 dataloader\uff0c\u540c\u65f6\u8bbe\u8ba1\u76f8\u5e94\u7684\u8bad\u7ec3\/\u8bc4\u4f30\u51fd\u6570\uff0c\u4ee5\u7528\u4e8e\u540e\u671f\u5bf9\u6a21\u578b\u7684\u8bad\u7ec3\u548c\u8bc4\u4f30\u3002\u5176\u6d41\u7a0b\u5982\u56fe2\u6240\u793a\uff0c\u5171\u5305\u542b5\u6b65\uff1a<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"752\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-2-1024x752.jpg\" alt=\"\u56fe2\uff1a\u6570\u636e\/\u6a21\u578b\u51c6\u5907\u8fc7\u7a0b\u7684\u6d41\u7a0b\u793a\u610f\u56fe\" class=\"wp-image-1071039\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-2-1024x752.jpg 1024w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-2-300x220.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-2-768x564.jpg 768w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-2-1536x1128.jpg 1536w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-2-2048x1504.jpg 2048w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-2-80x60.jpg 80w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-2-240x176.jpg 240w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption class=\"wp-element-caption\"><em>\u56fe2\uff1a\u6570\u636e\/\u6a21\u578b\u51c6\u5907\u8fc7\u7a0b\u7684\u6d41\u7a0b\u793a\u610f\u56fe<\/em><\/figcaption><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<p>\u5177\u4f53\u6765\u8bf4\uff0c\u9996\u5148\u9700\u8981\u4ece Transformers \u5e93\u4e2d\u52a0\u8f7d\u9884\u8bad\u7ec3\u6a21\u578b\uff0c\u7136\u540e\u5bf9\u6570\u636e GLUE-MNLI \u8fdb\u884c\u5904\u7406\uff0c\u5e76\u5f97\u5230\u76f8\u5e94\u7684 dataloader\u3002\u968f\u540e\uff0c\u9488\u5bf9\u6a21\u578b\u548c\u6570\u636e\u96c6 GLUE-MNLI\uff0c\u6784\u5efa\u76f8\u5e94\u7684\u8bad\u7ec3\/\u8bc4\u4f30\u51fd\u6570\u3002\u6700\u540e\u5c06\u6a21\u578b\u5728 GLUE-MNLI \u6570\u636e\u96c6\u4e0a\u8fdb\u884c\u5fae\u8c03\u3002<\/p>\n\n\n\n<p>\u5b8c\u6210\u4ee5\u4e0a\u6b65\u9aa4\u5c31\u76f8\u5f53\u4e8e\u5b8c\u6210\u4e86\u6570\u636e\/\u6a21\u578b\u7b49\u7684\u51c6\u5907\u5de5\u4f5c\uff0c\u53ef\u4ee5\u5f97\u5230\u9884\u8bad\u7ec3\u6a21\u578b\u5728 MNLI \u6570\u636e\u96c6\u4e0a\u5fae\u8c03\u540e\u7684\u6a21\u578b\u3002\u8003\u8651\u5230 Transformer \u7cfb\u5217\u9884\u8bad\u7ec3\u6a21\u578b\u7684\u6a21\u578b\u53c2\u6570\u4e2d\u7684\u5927\u5934\u4e3a\u5d4c\u5165\u5c42\uff0c\u4e14\u7f16\u7801\u5c42\/\u89e3\u7801\u5c42\u4e2d\u5305\u542b\u4e86\u591a\u5934\u81ea\u6ce8\u610f\u529b\u673a\u5236\u548c\u524d\u9988\u795e\u7ecf\u7f51\u7edc\u3002\u56e0\u6b64\uff0c\u5728\u4e4b\u540e\u7684\u6b65\u9aa4\u4e2d\u9700\u8981\u5206\u522b\u5bf9\u591a\u5934\u81ea\u6ce8\u610f\u529b\u673a\u5236\u3001\u5d4c\u5165\u5c42\u548c\u524d\u9988\u795e\u7ecf\u7f51\u7edc\u526a\u679d\uff0c\u5e76\u5f15\u5165\u52a8\u6001\u84b8\u998f\u673a\u5236\u5bf9\u526a\u679d\u540e\u7684\u6a21\u578b\u518d\u8bad\u7ec3\u3002<\/p>\n\n\n\n<p><strong>2. \u591a\u5934\u81ea\u6ce8\u610f\u529b\u673a\u5236\u7684\u526a\u679d\u548c\u57fa\u4e8e\u52a8\u6001\u84b8\u998f\u673a\u5236\u7684\u6a21\u578b\u518d\u8bad\u7ec3<\/strong><\/p>\n\n\n\n<p>\u591a\u5934\u81ea\u6ce8\u610f\u529b\u6a21\u5757\u7684\u526a\u679d\u548c\u6a21\u578b\u518d\u8bad\u7ec3\u5206\u4e3a3\u6b65\uff0c\u5982\u56fe3\u6240\u793a\uff1a\u9996\u5148\u8981\u6784\u5efa pruner\uff0c\u63a5\u7740\u5bf9\u591a\u5934\u81ea\u6ce8\u610f\u529b\u6a21\u5757\u8fdb\u884c\u526a\u679d\uff0c\u6700\u540e\u4f7f\u7528\u52a8\u6001\u84b8\u998f\u673a\u5236\u518d\u8bad\u7ec3\u6a21\u578b\u3002<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"103\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-3-1024x103.jpg\" alt=\"\u56fe3\uff1a\u591a\u5934\u81ea\u6ce8\u610f\u529b\u673a\u5236\u7684\u526a\u679d\u548c\u518d\u8bad\u7ec3\u6d41\u7a0b\u793a\u610f\u56fe\" class=\"wp-image-1071042\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-3-1024x103.jpg 1024w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-3-300x30.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-3-768x77.jpg 768w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-3-1536x154.jpg 1536w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-3-2048x206.jpg 2048w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-3-240x24.jpg 240w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption class=\"wp-element-caption\"><em>\u56fe3\uff1a\u591a\u5934\u81ea\u6ce8\u610f\u529b\u673a\u5236\u7684\u526a\u679d\u548c\u518d\u8bad\u7ec3\u6d41\u7a0b\u793a\u610f\u56fe<\/em><\/figcaption><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<p>\u5728\u8fdb\u884c\u526a\u679d\u524d\uff0c\u7528\u6237\u9700\u8981\u9009\u5b9a\u4e00\u4e2a\u526a\u679d\u7b97\u6cd5\u5e76\u5b9e\u4f8b\u5316\u76f8\u5e94\u7684 pruner\u3002\u6240\u6709\u7684\u526a\u679d\u7b97\u6cd5\u5747\u9700\u5411\u6a21\u578b\u4e2d\u4f20\u5165 config_list \u53c2\u6570\uff0c\u56e0\u4e3a\u5176\u5b9a\u4e49\u4e86\u9700\u8981\u526a\u679d\u7684\u8fd0\u7b97\u540d\u3001\u8fd0\u7b97\u7c7b\u522b\u53ca\u7a00\u758f\u5ea6\u7b49\u3002\u5177\u4f53\u5230 Movement \u526a\u679d\u7b97\u6cd5\uff0c\u8fd8\u9700\u8981\u8bbe\u7f6e\u5176\u4ed6\u7684\u4e00\u4e9b\u53c2\u6570\uff0c\u5982\uff1aevaluator \u53c2\u6570\uff0c\u7528\u4e8e\u8bad\u7ec3\u611f\u77e5\u7684\u6a21\u578b\u538b\u7f29\u8fc7\u7a0b\uff1bmovement_mode \u53c2\u6570\uff0c\u5171\u6709\u201csoft\u201c\u548c\u201dhard\u201c\u4e24\u79cd\u6a21\u5f0f\uff0c\u82e5\u4e3a\u201dsoft\u201d\uff0c\u5219\u96be\u4ee5\u7cbe\u786e\u5730\u63a7\u5236\u6a21\u578b\u526a\u679d\u540e\u7684\u7a00\u758f\u5ea6\uff0c\u4f46\u662f\u53ef\u4ee5\u5f97\u5230\u6027\u80fd\u66f4\u597d\u7684\u6a21\u578b\u3002\u53c2\u6570 regular_scale \u7528\u4e8e\u63a7\u5236\u526a\u679d\u7684\u7a00\u758f\u5ea6\uff0cregular_scale \u8d8a\u5927\uff0c\u6a21\u578b\u526a\u679d\u540e\u7684\u7a00\u758f\u5ea6\u8d8a\u9ad8\u3002\u66f4\u591a\u5176\u4ed6\u53c2\u6570\u53ef\u53c2\u9605https:\/\/nni.readthedocs.io\/zh\/stable\/reference\/compression\/pruner.html#movement-pruner<\/p>\n\n\n\n<p>\u63a5\u4e0b\u6765\uff0c\u8981\u4f7f\u7528\u6784\u9020\u7684\u526a\u679d\u7b97\u6cd5\u5b9e\u4f8b pruner \u5bf9\u591a\u5934\u81ea\u6ce8\u610f\u529b\u6a21\u5757\u8fdb\u884c\u526a\u679d\u3002\u7528\u6237\u53ea\u9700\u8c03\u7528 pruner.compress() \u5373\u53ef\u6267\u884c\u5bf9\u6a21\u578b\u7684\u526a\u679d\u8fc7\u7a0b\uff0c\u5e76\u5f97\u5230\u526a\u679d\u540e\u7684\u6a21\u578b\u548c attention_mask\u3002\u5176\u4e2d attention_mask \u7ed9\u51fa\u4e86\u9700\u8981\u526a\u679d\u7684\u5b50\u6a21\u5757\u7684\u53c2\u6570\u526a\u679d\u8303\u56f4\uff0c0\u4ee3\u8868\u8be5\u4f4d\u7f6e\u88ab\u526a\u6389\uff0c1\u4ee3\u8868\u8be5\u4f4d\u7f6e\u88ab\u4fdd\u7559\u3002<\/p>\n\n\n\n<p>NNI \u7684 SpeedUp \u6a21\u5757\u53ef\u4ee5\u5c06\u88ab mask \u4f4f\u7684\u53c2\u6570\u548c\u8ba1\u7b97\u4ece\u6a21\u578b\u4e2d\u5220\u9664\uff0c\u5177\u4f53\u7684\u5220\u9664\u903b\u8f91\u5982\u56fe4\u6240\u793a\uff0c\u4ee5 Query Linear \u5c42\u7684 weight\uff08\u8bb0\u4f5cQ\uff09\u4e3a\u4f8b\uff0c\u5176\u7ef4\u5ea6\u4e3a[768,768]\uff0c\u90a3\u4e48 Q \u7684 weight \u7684 mask \u77e9\u9635\u7ef4\u5ea6\u4e5f\u4e3a[768, 768]\uff0c\u5c06\u5176\u8bb0\u4f5c mask\u3002\u9996\u5148\u5c06\u8be5 mask \u77e9\u9635\u7684\u7ef4\u5ea6\u8fdb\u884c\u53d8\u6362\uff0c\u7b2c\u4e00\u7ef4\u662f\u591a\u5934\u6570\u76ee8\uff0c\u5176\u4f59\u7684\u5219\u662f\u7b2c\u4e8c\u7ef4\uff0c\u5c06\u53d8\u6362\u540e\u7684 mask \u77e9\u9635\u8bb0\u4f5c reshaped mask \u77e9\u9635\u3002\u63a5\u7740\uff0c\u5bf9 reshaped mask \u77e9\u9635\u5728\u7b2c\u4e8c\u7ef4\u5ea6\u4e0a\u6c42\u548c\uff0c\u5e76\u5224\u65ad\u6c42\u548c\u540e\u7684\u503c\u662f\u5426\u4e3a0\uff0c\u6b64\u65f6\u7684 mask \u77e9\u9635\u7ef4\u5ea6\u53d8\u4e3a[8]\uff0c\u6bcf\u4e2a\u4f4d\u7f6e\u5bf9\u5e94\u7740\u4e00\u4e2a\u591a\u5934\u3002\u5bf9\u4e8e\u53d8\u6362\u540e\u7684 mask \u77e9\u9635\uff0c\u82e5\u4f4d\u7f6e i \u7684\u503c\u4e3a0\uff0c\u5219\u4ee3\u8868\u5728 Q \u4e2d\u7684\u7b2c i \u4e2a\u591a\u5934\u9700\u8981\u88ab\u526a\u6389\u3002\u5728\u56fe\u4e2d\uff0c\u4f4d\u7f6e0\u30013\u30017\u7684\u503c\u5747\u4e3a0\uff0c\u56e0\u6b64\uff0c\u5728Q\u4e2d\u7684\u7b2c0\u30013\u30017\u4e2a\u591a\u5934\u9700\u8981\u88ab\u526a\u6389\u3002\u6700\u540e\uff0c\u5c06[0,3,7]\u4f5c\u4e3a\u53c2\u6570\u4f20\u5165 prune_heads \u51fd\u6570\u4e2d\uff0c\u5bf9 Q \u8fdb\u884c\u4fee\u526a\u3002\u4fee\u526a\u540e\uff0cQ \u7684\u7ef4\u5ea6\u4e3a[576,768]\u3002\u5bf9 SpeedUp \u66f4\u52a0\u5168\u9762\u7684\u4ecb\u7ecd\u53ef\u4ee5\u53c2\u8003\u53d1\u8868\u4e8e OSDI 2022 \u7684\u8bba\u6587 SparTA\u3002\u5728\u5373\u5c06\u53d1\u5e03\u7684 NNI 3.0 \u4e2d SpeedUp \u4f1a\u5bf9\u66f4\u591a\u6a21\u578b\u63d0\u4f9b\u66f4\u52a0\u5b8c\u5584\u7684\u652f\u6301\u3002<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"395\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-4-1024x395.jpg\" alt=\"\u56fe4\uff1a\u5229\u7528 prune_heads \u51fd\u6570\u4fee\u526a\u81ea\u6ce8\u610f\u529b\u6a21\u5757\u7684\u8fc7\u7a0b\u793a\u610f\u56fe\" class=\"wp-image-1071045\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-4-1024x395.jpg 1024w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-4-300x116.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-4-768x297.jpg 768w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-4-1536x593.jpg 1536w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-4-2048x791.jpg 2048w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-4-240x93.jpg 240w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption class=\"wp-element-caption\"><em>\u56fe4\uff1a\u5229\u7528 prune_heads \u51fd\u6570\u4fee\u526a\u81ea\u6ce8\u610f\u529b\u6a21\u5757\u7684\u8fc7\u7a0b\u793a\u610f\u56fe<\/em><\/figcaption><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<p>\u5728\u5bf9\u591a\u5934\u81ea\u6ce8\u610f\u529b\u6a21\u5757\u526a\u679d\u540e\uff0c\u4ee5\u5fae\u8c03\u540e\u7684\u6a21\u578b\u4f5c\u4e3a\u6559\u5e08\u6a21\u578b\uff0c\u4ee5\u526a\u679d\u540e\u7684\u6a21\u578b\u4f5c\u4e3a\u5b66\u751f\u6a21\u578b\uff0c\u7136\u540e\u501f\u9274 CoFi \u4e2d\u7684\u52a8\u6001\u84b8\u998f\u673a\u5236 [1] \u5bf9\u6a21\u578b\u8fdb\u884c\u518d\u8bad\u7ec3\uff0c\u5c31\u53ef\u4ee5\u5f97\u5230\u65b0\u7684\u6a21\u578b\u3002\u8fd9\u91cc\u7684\u52a8\u6001\u84b8\u998f\u673a\u5236\uff0c\u662f\u6307\u6559\u5e08\u6a21\u578b\u7684\u5c42\u548c\u5b66\u751f\u6a21\u578b\u7684\u5c42\u4e4b\u95f4\u4e0d\u662f\u4e00\u4e2a\u9759\u6001\u5bf9\u5e94\u5173\u7cfb\uff0c\u6bcf\u6b21\u84b8\u998f\u6559\u5e08\u90fd\u53ef\u4ee5\u9009\u62e9\u4ece\u81ea\u8eab\u7684\u9ad8\u5c42\u52a8\u6001\u84b8\u998f\u4fe1\u606f\u5230\u5b66\u751f\u6a21\u578b\u4f4e\u5c42\u4e2d\u7684\u4e00\u5c42\u91cc\u3002<\/p>\n\n\n\n<p><strong>3. \u5d4c\u5165\u5c42\u548c\u524d\u9988\u795e\u7ecf\u7f51\u7edc\u7684\u526a\u679d\uff0c\u4ee5\u53ca\u57fa\u4e8e\u52a8\u6001\u84b8\u998f\u673a\u5236\u7684\u6a21\u578b\u518d\u8bad\u7ec3<\/strong><\/p>\n\n\n\n<p>\u5d4c\u5165\u5c42\u548c\u524d\u9988\u795e\u7ecf\u7f51\u7edc\u7684\u526a\u679d\u8fc7\u7a0b\u4e0e\u591a\u5934\u81ea\u6ce8\u610f\u529b\u6a21\u5757\u7684\u526a\u679d\u8fc7\u7a0b\u7c7b\u4f3c\u3002\u6b64\u5904\u4f7f\u7528 Taylor \u526a\u679d\u7b97\u6cd5 \uff08https:\/\/nni.readthedocs.io\/zh\/stable\/reference\/compression\/pruner.html#taylor-fo-weight-pruner \uff09 \u5bf9\u5d4c\u5165\u5c42\u548c\u524d\u9988\u795e\u7ecf\u7f51\u7edc\u8fdb\u884c\u526a\u679d\u3002\u540c\u6837\u5730\uff0c\u7814\u7a76\u5458\u4eec\u5b9a\u4e49\u4e86 config_list\u3001evaluator \u53c2\u6570\u53ca taylor_pruner_steps \u53c2\u6570\u3002\u7531\u4e8e\u5d4c\u5165\u5c42\u7684\u7ef4\u5ea6\u4e0e\u540e\u7eed\u6a21\u578b\u4e2d\u7684\u7ef4\u5ea6\u5177\u6709\u76f8\u5173\u6027\u3002\u56e0\u6b64\uff0c\u57fa\u4e8e\u4e0a\u8ff0\u53c2\u6570\uff0c\u5728\u5d4c\u5165\u5c42\u7684\u526a\u679d\u8fc7\u7a0b\u4e2d\u7814\u7a76\u5458\u4eec\u5c06\u526a\u679d\u6a21\u5f0f mode \u8bbe\u7f6e\u4e3a\u4e86\u201cdependency-aware\u201d\u6a21\u5f0f\uff0c\u5e76\u4f20\u5165\u6a21\u578b\u7684\u8f93\u5165 dummy_input\uff0c\u4ee5\u5e2e\u52a9 pruner \u6355\u6349\u548c\u5d4c\u5165\u5c42\u7ef4\u5ea6\u5177\u6709\u4f9d\u8d56\u5173\u7cfb\u7684\u5b50\u6a21\u578b\u3002<\/p>\n\n\n\n<p>\u63a5\u4e0b\u6765\uff0c\u4f7f\u7528\u5206\u522b\u6784\u9020\u7684 pruner \u5bf9\u524d\u9988\u795e\u7ecf\u7f51\u7edc\u548c\u5d4c\u5165\u5c42\u8fdb\u884c\u526a\u679d\u3002\u548c\u591a\u5934\u81ea\u6ce8\u610f\u529b\u6a21\u5757\u7684\u526a\u679d\u4e0d\u540c\u7684\u662f\uff0c\u6b64\u5904\u4f7f\u7528\u4e86\u8fed\u4ee3\u5f0f\u526a\u679d\u6cd5\uff0c\u5373\u5728\u6a21\u578b\u57fa\u4e8e\u52a8\u6001\u84b8\u998f\u7684\u518d\u8bad\u7ec3\u8fc7\u7a0b\u4e2d\uff0c\u6bcf2000\u6b65\u5206\u522b\u4f7f\u7528 pruner \u5bf9\u524d\u9988\u795e\u7ecf\u7f51\u7edc\u548c\u5d4c\u5165\u5c42\u526a\u679d\u4e00\u6b21\uff0c\u5176\u4e2d\uff0c\u524d\u9988\u795e\u7ecf\u7f51\u7edc\u5171\u526a\u679d19\/24\u6b21\uff0c\u5d4c\u5165\u5c42\u5171\u526a\u679d3\u6b21\u3002\u6bcf\u6b21\u526a\u679d\u540e\uff0c\u4f7f\u7528 ModelSpeedUp \u5bf9\u524d\u9988\u795e\u7ecf\u7f51\u7edc\u5c42\u8fdb\u884c\u526a\u679d\uff0c\u4ee5\u5b9e\u73b0\u771f\u6b63\u610f\u4e49\u4e0a\u7684\u4fee\u526a\u53c2\u6570\uff0c\u800c\u4e0d\u662f\u5c06\u9700\u8981\u4fee\u526a\u7684\u53c2\u6570\u75280\u66ff\u6362\u3002<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"\u5b9e\u9a8c\u7ed3\u679c\">\u5b9e\u9a8c\u7ed3\u679c<\/h2>\n\n\n\n<p>\u901a\u8fc7\u8c03\u6574 regular_scale \u53c2\u6570\u7684\u503c\u548c\u524d\u9988\u795e\u7ecf\u7f51\u7edc\u7684\u526a\u679d\u6b21\u6570\uff0c\u7814\u7a76\u5458\u4eec\u5f97\u5230\u4e86\u5177\u6709\u4e0d\u540c\u7a00\u758f\u5ea6\u548c\u6027\u80fd\u7684\u6a21\u578b\u3002\u8be5\u8fc7\u7a0b\u4f7f\u7528\u4e861\u5f20 A100 \u8fdb\u884c\u5b9e\u9a8c\uff0c\u5e76\u8bbe\u7f6e batch_size \u4e3a32\u3002<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"620\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-5-1024x620.jpg\" alt=\"\u56fe5\uff1a\u5b9e\u9a8c\u7ed3\u679c\" class=\"wp-image-1071048\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-5-1024x620.jpg 1024w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-5-300x182.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-5-768x465.jpg 768w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-5-1536x930.jpg 1536w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-5-2048x1240.jpg 2048w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-5-240x145.jpg 240w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption class=\"wp-element-caption\"><em>\u56fe5\uff1a\u5b9e\u9a8c\u7ed3\u679c<\/em><\/figcaption><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<p>\u4ece\u4e0a\u56fe\u5b9e\u9a8c\u7ed3\u679c\u53ef\u4ee5\u770b\u51fa\uff1a<\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li>\u968f\u7740 regular_scale \u7684\u589e\u52a0\uff0c\u6a21\u578b\u603b\u7684\u7a00\u758f\u5ea6\u6709\u6240\u589e\u52a0\u3002\u5f53 regular_scale \u5927\u4e8e\u7b49\u4e8e10\u65f6\uff0c\u6a21\u578b\u603b\u7684\u7a00\u758f\u5ea6\u8d85\u8fc7\u4e8669%\uff0c\u6027\u80fd\u635f\u5931\u8d85\u8fc71%\u3002<\/li>\n\n\n\n<li>\u968f\u7740\u524d\u9988\u795e\u7ecf\u7f51\u7edc\u526a\u679d\u6b21\u6570\u7684\u589e\u52a0\uff0c\u6a21\u578b\u603b\u7684\u7a00\u758f\u5ea6\u6709\u6240\u589e\u52a0\uff0c\u540c\u65f6\u6a21\u578b\u7684\u6027\u80fd\u6709\u6240\u4e0b\u964d\uff0c\u4e14\u968f\u7740\u6a21\u578b\u603b\u7a00\u758f\u5ea6\u7684\u589e\u52a0\uff0c\u6a21\u578b\u7684\u6027\u80fd\u4e0b\u964d\u7a0b\u5ea6\u9010\u6e10\u589e\u5927\u3002<\/li>\n\n\n\n<li>\u5bf9\u5d4c\u5165\u5c42\u526a\u679d3\u6b21\uff0c\u80fd\u591f\u5c06\u6a21\u578b\u7684\u7ef4\u5ea6\u4ece768\u51cf\u5c0f\u81f3561\uff0c\u5728\u4e00\u5b9a\u7a0b\u5ea6\u4e0a\u63d0\u5347\u4e86\u6a21\u578b\u603b\u7684\u7a00\u758f\u5ea6\u3002<\/li>\n<\/ol>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"\u5b9e\u9a8c\u7ed3\u679c\u4e0e\u5e73\u53f0\u5bf9\u6bd4\">\u5b9e\u9a8c\u7ed3\u679c\u4e0e\u5e73\u53f0\u5bf9\u6bd4<\/h2>\n\n\n\n<p>\u8fdb\u4e00\u6b65\u5206\u6790\u5b9e\u9a8c\u7ed3\u679c\u53ef\u4ee5\u53d1\u73b0\uff0c\u4f7f\u7528 NNI \u5bf9 BERT \u5728 MNLI \u6570\u636e\u96c6\u4e0a\u526a\u679d\u540e\u7684\u6027\u80fd\u597d\u4e8e nn pruning \u6846\u67b6\uff08\u56fe6(a)\uff09\uff0c\u4e14\u5f53\u6a21\u578b\u603b\u7684\u7a00\u758f\u5ea6\u4f4e\u4e8e65%\u65f6\uff0cNNI \u548c CoFi \u5bf9 BERT \u5728 MNLI \u6570\u636e\u96c6\u4e0a\u526a\u679d\u7684\u6027\u80fd\u5dee\u8ddd\u8f83\u5c0f\uff0c\u5f53\u6a21\u578b\u603b\u7684\u7a00\u758f\u5ea6\u5927\u4e8e65%\u65f6\uff0c\u4f7f\u7528 NNI \u5bf9 BERT \u5728 MNLI \u6570\u636e\u96c6\u4e0a\u526a\u679d\u540e\u7684\u6027\u80fd\u597d\u4e8e CoFi\u3002\u56fe6(b)\u548c\u56fe6(c)\u5206\u522b\u5c55\u793a\u4e86 NNI \u5728 T5 \u548c ViT \u6a21\u578b\u4e0a\u7684\u526a\u679d\u6027\u80fd\u3002\u4ece\u56fe\u4e2d\u53ef\u4ee5\u770b\u51fa\uff0c\u5f53\u6a21\u578b\u76f8\u5e94\u90e8\u5206\u7684\u7a00\u758f\u5ea6\u8d85\u8fc7\u4e8675%\u540e\uff0c\u6a21\u578b\u6027\u80fd\u4e0b\u964d\u7ea6\u4e3a3%\uff0c\u5f53\u6a21\u578b\u76f8\u5e94\u90e8\u5206\u7684\u7a00\u758f\u5ea6\u4f4e\u4e8e50%\u65f6\uff0c\u6a21\u578b\u6027\u80fd\u4e0b\u964d\u8f83\u5c11\u3002<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"432\" height=\"288\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-6-1.png\" alt=\"diagram\" class=\"wp-image-1071051\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-6-1.png 432w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-6-1-300x200.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-6-1-240x160.png 240w\" sizes=\"auto, (max-width: 432px) 100vw, 432px\" \/><\/figure>\n\n\n\n<p>(a)<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"432\" height=\"288\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-6-2.png\" alt=\"diagram\" class=\"wp-image-1071054\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-6-2.png 432w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-6-2-300x200.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-6-2-240x160.png 240w\" sizes=\"auto, (max-width: 432px) 100vw, 432px\" \/><\/figure>\n\n\n\n<p>(b)<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"432\" height=\"288\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-6-3.png\" alt=\"diagram\" class=\"wp-image-1071057\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-6-3.png 432w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-6-3-300x200.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-6-3-240x160.png 240w\" sizes=\"auto, (max-width: 432px) 100vw, 432px\" \/><\/figure>\n\n\n\n<p>(c)<\/p>\n\n\n\n<p><em>\u56fe6\uff1aNNI \u5728\u7ecf\u5178\u9884\u8bad\u7ec3\u6a21\u578b\u4e0b\u7684\u526a\u679d\u6027\u80fd\u793a\u610f\u56fe<\/em><\/p>\n\n\n\n<p>\u4e09\u4e2a\u5e73\u53f0\uff08Paper\uff09\u7684\u8be6\u7ec6\u6bd4\u8f83\u7ed3\u679c\uff0c\u5982\u88681\u6240\u793a\u3002\u53ef\u4ee5\u770b\u51fa\uff0cNNI \u7684 Compression \u6a21\u5757\u4e0d\u4ec5\u5177\u6709\u5b8c\u6574\u7684\u6559\u7a0b\u5b9e\u4f8b\uff0c\u540c\u65f6\u8fd8\u63d0\u4f9b\u4e86 SpeedUp \u6a21\u5757\uff0c\u80fd\u591f\u5b9e\u73b0\u771f\u6b63\u610f\u4e49\u4e0a\u7684\u51cf\u5c11\u6a21\u578b\u53c2\u6570\u91cf\uff0c\u800c\u975e\u5c06\u9700\u8981\u4fee\u526a\u7684\u53c2\u6570\u7f6e\u4e3a0\u3002<\/p>\n\n\n\n<p>\u540c\u65f6\uff0cNNI \u652f\u6301 BERT\u3001RoBerta\u3001GPT\u3001BART\u3001T5\u3001ViT \u7b49\u4e3b\u6d41\u6a21\u578b\uff0c\u5e76\u63d0\u4f9b\u4e86 Taylor\u3001Movement\u3001ADMM\u3001Slim\u3001AGP\u3001Activation APoZ\u3001Activation Mean \u7b4916\u79cd\u524d\u6cbf\u526a\u679d\u7b97\u6cd5\uff0c\u80fd\u591f\u66f4\u597d\u5730\u6ee1\u8db3\u7528\u6237\u7684\u9700\u6c42\uff0c\u5177\u6709\u8f83\u5f3a\u7684\u901a\u7528\u6027\u3002<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"631\" height=\"478\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-7.jpg\" alt=\"table\" class=\"wp-image-1071060\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-7.jpg 631w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-7-300x227.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-7-80x60.jpg 80w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2024\/08\/nni-pruning-7-238x180.jpg 238w\" sizes=\"auto, (max-width: 631px) 100vw, 631px\" \/><figcaption class=\"wp-element-caption\"><em>\u88681\uff1a\u5404\u5e73\u53f0\uff08Paper\uff09\u529f\u80fd\u5bf9\u6bd4\u603b\u7ed3<\/em><\/figcaption><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"\u5c55\u671b\u672a\u6765\">\u5c55\u671b\u672a\u6765<\/h2>\n\n\n\n<p>\u5728 NNI 3.0 \u7248\u672c\u4e2d\uff0c\u5fae\u8f6f\u4e9a\u6d32\u7814\u7a76\u9662\u7684\u7814\u7a76\u5458\u4eec\u8fd8\u5c06\u5f15\u5165\u84b8\u998f\u6a21\u5757\uff0c\u66f4\u597d\u5730\u4e3a\u7528\u6237\u63d0\u4f9b\u96c6\u526a\u679d\u3001\u84b8\u998f\u4e3a\u4e00\u4f53\u7684\u538b\u7f29\u5de5\u5177\uff0c\u540c\u65f6 SpeedUp \u6a21\u5757\u4e5f\u5c06\u66f4\u5168\u9762\u5730\u652f\u6301\u5bf9 Transformer \u7684\u4fee\u526a\u3002\u656c\u8bf7\u671f\u5f85\uff01<\/p>\n\n\n\n<p>\u5173\u4e8e\u6700\u65b0\u7248 NNI \u7684\u5b8c\u6574\u4ee3\u7801\u548c tutorial\uff0c\u8bf7\u53c2\u89c1\uff1a<br>https:\/\/nni.readthedocs.io\/zh\/stable\/tutorials\/pruning_bert_glue.html<\/p>\n\n\n\n<p>NNI \u5feb\u901f\u5165\u95e8\u89c6\u9891\u6559\u7a0b\uff1ahttps:\/\/space.bilibili.com\/1649051673<\/p>\n\n\n\n<p>\u53c2\u8003\u8bba\u6587\uff1a<\/p>\n\n\n\n<p>[1] https:\/\/arxiv.org\/pdf\/2204.00408.pdf<\/p>\n","protected":false},"excerpt":{"rendered":"<p>\u65e0\u8bba\u5728\u5b66\u672f\u754c\u8fd8\u662f\u4ea7\u4e1a\u754c\uff0c\u4eca\u5e74\u4eba\u5de5\u667a\u80fd\u5927\u6a21\u578b\u90fd\u662f\u7206\u6b3e\u8bdd\u9898\u3002\u4f46\u9762\u5bf9\u8fd9\u4e9b\u52a8\u4e0d\u52a8\u5c31\u6570\u5341\u4ebf\u7ea7\u522b\u53c2\u6570\u7684\u6a21\u578b\uff0c\u4f7f\u7528\u4f20\u7edf\u65b9\u6cd5\u5fae\u8c03\uff0c\u5b9b\u5982\u6c34\u4e2d\u635e\u6708\u3001\u6d77\u5e95\u635e\u9488\u3002\u4f5c\u4e3a\u5fae\u8f6f\u4e9a\u6d32\u7814\u7a76\u9662\u4e3a\u79d1\u7814\u4eba\u5458\u548c\u7b97\u6cd5\u5de5\u7a0b\u5e08\u91cf\u8eab\u5b9a\u5236\u7684\u4e00\u7ad9\u5f0f AutoML\uff08\u81ea\u52a8\u673a\u5668\u5b66\u4e60\uff09\u5de5\u5177\uff0c NNI\uff08Neural Network Intelligence\uff09\u5728\u8fc7\u53bb\u7684\u4e09\u5e74\u95f4\u4e0d\u65ad\u8fed\u4ee3\u66f4\u65b0\uff0c\u52a0\u5f3a\u4e86\u5bf9\u5404\u79cd\u5206\u5e03\u5f0f\u8bad\u7ec3\u73af\u5883\u7684\u652f\u6301\uff0c\u6210\u4e3a\u4e86\u6700\u70ed\u95e8\u7684 AutoML \u5f00\u6e90\u9879\u76ee\u4e4b\u4e00\u3002 \u8fd1\u65e5\uff0c\u5fae\u8f6f\u4e9a\u6d32\u7814\u7a76\u9662\u5bf9 NNI \u8fdb\u884c\u4e86\u66f4\u65b0\u3002\u5728\u6700\u65b0\u7684\u7248\u672c\u4e2d\uff0cNNI \u96c6\u6210\u4e86\u5927\u91cf\u524d\u6cbf\u7684\u526a\u679d\u7b97\u6cd5\uff0c\u5982 TaylorFO Weight\u3001Movement \u7b49\u3002\u57fa\u4e8e\u73b0\u6709\u7684\u7ecf\u5178\u9884\u8bad\u7ec3\u6a21\u578b\uff0c\u7814\u7a76\u5458\u4eec\u901a\u8fc7\u5927\u91cf\u5b9e\u9a8c\uff0c\u53d1\u73b0\u4e86\u65e2\u80fd\u964d\u4f4e\u6a21\u578b\u53c2\u6570\u91cf\u548c\u8ba1\u7b97\u91cf\uff0c\u53c8\u80fd\u4fdd\u6301\u6a21\u578b\u8f83\u9ad8\u7cbe\u5ea6\u7684\u526a\u679d\u6b65\u9aa4\u4e0e\u7b97\u6cd5\u7ec4\u5408\uff0c\u83b7\u5f97\u8d85\u8d8a SOTA \u7684\u6a21\u578b\u526a\u679d\u6548\u679c\u3002 \u4eca\u5929\u6211\u4eec\u5c31\u4ee5 Transformer \u7cfb\u5217\u7684\u9884\u8bad\u7ec3\u6a21\u578b\u548c\u6570\u636e\u96c6 GLUE-MNLI \u4e3a\u4f8b\uff0c\u4e3a\u5927\u5bb6\u4ecb\u7ecd\u4e00\u4e0b NNI \u7684 pruner \u526a\u679d\u6d41\u7a0b\u548c\u4f7f\u7528\u7684\u526a\u679d\u7b97\u6cd5\u7ec4\u5408\u3002 \u5728\u6b63\u5f0f\u4ecb\u7ecd\u526a\u679d\u6d41\u7a0b\u524d\uff0c\u6211\u4eec\u9700\u8981\u5148\u4e86\u89e3\u4ec0\u4e48\u662f pruner\uff0cmask \u548c SpeedUp\u3002 \u5728\u4f7f\u7528 NNI Compression \u6a21\u5757\u4e2d\u7684 pruner \u8fdb\u884c\u526a\u679d\u64cd\u4f5c\u65f6\uff0c\u7528\u6237\u53ea\u9700\u5b8c\u6210\u6570\u636e\/\u6a21\u578b\u7b49\u7684\u51c6\u5907\u3001pruner \u7684\u6784\u5efa\uff0c\u4ee5\u53ca\u6a21\u578b\u526a\u679d\u548c\u518d\u8bad\u7ec3\uff0c\u5373\u53ef\u4e3a\u6a21\u578b\u6784\u5efa\u4e00\u4e2a\u526a\u679d\u7684 pipeline\u3002 \u4ee5 Transformer \u7cfb\u5217\u7684\u9884\u8bad\u7ec3\u6a21\u578b\u4e3a\u4f8b\uff0c\u5176\u526a\u679d\u6d41\u7a0b\u5171\u5305\u542b4\u6b65\uff1a\u9996\u5148\u51c6\u5907\u6570\u636e\/\u6a21\u578b\u7b49\uff0c\u63a5\u7740\u9488\u5bf9\u591a\u5934\u81ea\u6ce8\u610f\u529b\u673a\u5236\uff08Multi-head Attention\uff09\u3001\u5d4c\u5165\u5c42\uff08embedding\uff09\u548c\u524d\u9988\u795e\u7ecf\u7f51\u7edc\uff08FFN\uff09\u5206\u522b\u526a\u679d\u548c\u518d\u8bad\u7ec3\u6a21\u578b\u3002 1. \u51c6\u5907\u6570\u636e\/\u6a21\u578b\u7b49 \u5728\u6b63\u5f0f\u6784\u5efa\u526a\u679d\u8fc7\u7a0b\u4e4b\u524d\uff0c\u7528\u6237\u9700\u8981\u52a0\u8f7d\u9884\u8bad\u7ec3\u6a21\u578b\uff0c\u5bf9\u6570\u636e\u9884\u5904\u7406\u5e76\u521b\u5efa\u76f8\u5e94\u7684 dataloader\uff0c\u540c\u65f6\u8bbe\u8ba1\u76f8\u5e94\u7684\u8bad\u7ec3\/\u8bc4\u4f30\u51fd\u6570\uff0c\u4ee5\u7528\u4e8e\u540e\u671f\u5bf9\u6a21\u578b\u7684\u8bad\u7ec3\u548c\u8bc4\u4f30\u3002\u5176\u6d41\u7a0b\u5982\u56fe2\u6240\u793a\uff0c\u5171\u5305\u542b5\u6b65\uff1a \u5177\u4f53\u6765\u8bf4\uff0c\u9996\u5148\u9700\u8981\u4ece Transformers \u5e93\u4e2d\u52a0\u8f7d\u9884\u8bad\u7ec3\u6a21\u578b\uff0c\u7136\u540e\u5bf9\u6570\u636e GLUE-MNLI \u8fdb\u884c\u5904\u7406\uff0c\u5e76\u5f97\u5230\u76f8\u5e94\u7684 dataloader\u3002\u968f\u540e\uff0c\u9488\u5bf9\u6a21\u578b\u548c\u6570\u636e\u96c6 GLUE-MNLI\uff0c\u6784\u5efa\u76f8\u5e94\u7684\u8bad\u7ec3\/\u8bc4\u4f30\u51fd\u6570\u3002\u6700\u540e\u5c06\u6a21\u578b\u5728 GLUE-MNLI \u6570\u636e\u96c6\u4e0a\u8fdb\u884c\u5fae\u8c03\u3002 \u5b8c\u6210\u4ee5\u4e0a\u6b65\u9aa4\u5c31\u76f8\u5f53\u4e8e\u5b8c\u6210\u4e86\u6570\u636e\/\u6a21\u578b\u7b49\u7684\u51c6\u5907\u5de5\u4f5c\uff0c\u53ef\u4ee5\u5f97\u5230\u9884\u8bad\u7ec3\u6a21\u578b\u5728 [&hellip;]<\/p>\n","protected":false},"author":34512,"featured_media":1086627,"template":"","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","msr-content-parent":1012650,"msr_hide_image_in_river":0,"footnotes":""},"research-area":[],"msr-locale":[268881],"msr-post-option":[],"class_list":["post-1071030","msr-blog-post","type-msr-blog-post","status-publish","has-post-thumbnail","hentry","msr-locale-zh_cn"],"msr_assoc_parent":{"id":1012650,"type":"lab"},"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post\/1071030","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/msr-blog-post"}],"author":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/users\/34512"}],"version-history":[{"count":4,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post\/1071030\/revisions"}],"predecessor-version":[{"id":1086630,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-blog-post\/1071030\/revisions\/1086630"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media\/1086627"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=1071030"}],"wp:term":[{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=1071030"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=1071030"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=1071030"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}