吉祥寺北口システムが気になった記事をクリップしています。タイトルから元記事にリンクしています。タグは記事タイトルを形態素分析しています。たまにコメントをつけています。

Inference

How we built the most efficient inference engine for Cloudflare’s network
- built(44)
- CloudFlare(712)
- efficient(16)
- Engine(264)
- for(5515)
- how(326)
- Inference(33)
- most(81)
- Network(416)
- the(4555)
- We(175)
How we used OpenBMC to support AI inference on GPUs around the world
- ai(6231)
- around(34)
- GPUs(13)
- how(326)
- Inference(33)
- on(1950)
- OpenBMC(2)
- Support(669)
- the(4555)
- to(3435)
- Used(60)
- We(175)
- world(304)
Announcing Llama 2 Inference APIs and Hosted Fine-Tuning through Models-as-a-Service in Azure AI – Microsoft Community Hub
- ai(6231)
- and(3481)
- Announcing(458)
- Apis(43)
- As(319)
- Azure(743)
- Community(378)
- Fine(11)
- hosted(16)
- Hub(216)
- in(2587)
- Inference(33)
- Llama(32)
- Microsoft(4534)
- Models(59)
- Service(856)
- through(104)
- tuning(5)
Workers AI: serverless GPU-powered inference on Cloudflare’s global network
- ai(6231)
- CloudFlare(712)
- Global(362)
- GPU(235)
- Inference(33)
- Network(416)
- on(1950)
- powered(257)
- Serverless(107)
- workers(166)
The best place on Region: Earth for inference
- best(81)
- Earth(91)
- for(5515)
- Inference(33)
- on(1950)
- place(57)
- Region(30)
- the(4555)
GitHub – facebookresearch/codellama: Inference code for CodeLlama models
- Code(453)
- codellama(1)
- facebookresearch(3)
- for(5515)
- GitHub(1016)
- Inference(33)
- Models(59)
How Cloudflare runs machine learning inference in microseconds
- CloudFlare(712)
- how(326)
- in(2587)
- Inference(33)
- Learning(126)
- Machine(120)
- microseconds(1)
- Runs(13)
Announcing the Microsoft Machine Learning Membership Inference Competition (MICO) – Microsoft Security Response Center
- Announcing(458)
- Center(775)
- Competition(24)
- Inference(33)
- Learning(126)
- Machine(120)
- Membership(2)
- MICO(1)
- Microsoft(4534)
- response(329)
- Security(5841)
- the(4555)
Amazon SageMaker で NVIDIA Triton Inference Server を使用してモデルサーバのハイパースケールパフォーマンスを実現する | Amazon Web Services ブログ
- Amazon(8960)
- Inference(33)
- NVIDIA(256)
- SageMaker(373)
- Server(738)
- Services(7017)
- Triton(4)
- Web(9895)
- サーバ(807)
- スケール(113)
- ハイパー(90)
- パフォーマンス(351)
- ブログ(8419)
- モデル(1257)
- 使用(2402)
- 実現(3361)
Amazon SageMaker の NVIDIA Triton Inference Server を使用して高速でスケーラブルな AI をデプロイする | Amazon Web Services ブログ
- ai(6231)
- Amazon(8960)
- Inference(33)
- NVIDIA(256)
- SageMaker(373)
- Server(738)
- Services(7017)
- Triton(4)
- Web(9895)
- デプロイ(188)
- ブログ(8419)
- 使用(2402)
- 高速(842)
Amazon SageMaker Serverless Inference — サーバーレスで推論用の機械学習モデルをデプロイ可能に | Amazon Web Services ブログ
- Amazon(8960)
- Inference(33)
- SageMaker(373)
- Serverless(107)
- Services(7017)
- Web(9895)
- サーバー(1187)
- デプロイ(188)
- ブログ(8419)
- モデル(1257)
- レス(409)
- 可能(4298)
- 学習(843)
- 推論(71)
- 機械(475)
Amazon SageMaker Inference Recommender を発表 | Amazon Web Services ブログ
- Amazon(8960)
- Inference(33)
- Recommender(1)
- SageMaker(373)
- Services(7017)
- Web(9895)
- ブログ(8419)
- 発表(8396)
[2110.06037] SoftNeuro: Fast Deep Inference using Multi-platform Optimization
- 2110.06037(1)
- Deep(182)
- Fast(95)
- Inference(33)
- MULTI-PLATFORM(3)
- optimization(29)
- SoftNeuro(2)
- using(225)
Android Developers Blog: Announcing Android’s updateable, fully integrated ML inference stack
- Android(2174)
- Android’s(5)
- Announcing(458)
- Blog(6570)
- Developers(406)
- FULLY(30)
- Inference(33)
- Integrated(23)
- ML(99)
- Stack(117)
- updateable(1)
Listen to Your Key: Towards Acoustics-based Physical Key Inference
- Acoustics-based(1)
- Inference(33)
- Key(87)
- Listen(11)
- physical(12)
- to(3435)
- Towards(15)
- Your(582)
Amazon Elastic Inference で PyTorch モデル向け Amazon EC2 の推論コストを削減する | Amazon Web Services ブログ
- Amazon(8960)
- EC(1498)
- Elastic(147)
- Inference(33)
- PyTorch(24)
- Services(7017)
- Web(9895)
- コスト(641)
- ブログ(8419)
- モデル(1257)
- 削減(699)
- 推論(71)
Amazon SageMaker Neo と Amazon Elastic Inference を使用してパフォーマンスを向上させ、MXNet 推論のコストを削減する | Amazon Web Services ブログ
- Amazon(8960)
- Elastic(147)
- Inference(33)
- MXNet(37)
- NEO(70)
- SageMaker(373)
- Services(7017)
- Web(9895)
- コスト(641)
- パフォーマンス(351)
- ブログ(8419)
- 使用(2402)
- 削減(699)
- 向上(1544)
- 推論(71)
Amazon Elastic Inference を使用して Amazon SageMaker で PyTorch モデルの ML 推論コストを削減する | Amazon Web Services ブログ
- Amazon(8960)
- Elastic(147)
- Inference(33)
- ML(99)
- PyTorch(24)
- SageMaker(373)
- Services(7017)
- Web(9895)
- コスト(641)
- ブログ(8419)
- モデル(1257)
- 使用(2402)
- 削減(699)
- 推論(71)
Kubernetes および Amazon Elastic Inference を使用した TensorFlow モデルの最適化 | Amazon Web Services ブログ
- Amazon(8960)
- Elastic(147)
- Inference(33)
- Kubernetes(337)
- Services(7017)
- TensorFlow(48)
- Web(9895)
- ブログ(8419)
- モデル(1257)
- 使用(2402)
- 最適化(488)
Apache MXNet、AWS Lambda、Amazon Elastic Inference を使って深層学習を提供している Curalate 社 | Amazon Web Services ブログ
- Amazon(8960)
- apache(544)
- AWS(4340)
- Curalate(2)
- Elastic(147)
- Inference(33)
- Lambda(217)
- MXNet(37)
- Services(7017)
- Web(9895)
- ブログ(8419)
- 学習(843)
- 提供(16089)
- 深層(70)
Amazon ECS で Amazon Elastic Inference ワークロードを実行する | Amazon Web Services ブログ
- Amazon(8960)
- ECS(112)
- Elastic(147)
- Inference(33)
- Services(7017)
- Web(9895)
- ブログ(8419)
- ロード(239)
- ワーク(1192)
- 実行(943)
Amazon TensorFlow を使用した Amazon Elastic Inference でのコストの最適化 | Amazon Web Services ブログ
- Amazon(8960)
- Elastic(147)
- Inference(33)
- Services(7017)
- TensorFlow(48)
- Web(9895)
- コスト(641)
- ブログ(8419)
- 使用(2402)
- 最適化(488)
MXNet と Amazon Elastic Inference を使った Java ベースの深層学習の実行 | Amazon Web Services ブログ
- Amazon(8960)
- Elastic(147)
- Inference(33)
- Java(530)
- MXNet(37)
- Services(7017)
- Web(9895)
- ブログ(8419)
- ベース(644)
- 学習(843)
- 実行(943)
- 深層(70)
EC2 用の Amazon Elastic Inference 設定ツールを使用して、EI アクセラレータを数分で起動する | Amazon Web Services ブログ
- Amazon(8960)
- EC(1498)
- EI(3)
- Elastic(147)
- Inference(33)
- Services(7017)
- Web(9895)
- アクセラレータ(24)
- ツール(2864)
- ブログ(8419)
- 使用(2402)
- 数分(23)
- 設定(905)
- 起動(182)
MXNet と Amazon Elastic Inference を使用した、深層学習の推論コストの削減 | Amazon Web Services ブログ
- Amazon(8960)
- Elastic(147)
- Inference(33)
- MXNet(37)
- Services(7017)
- Web(9895)
- コスト(641)
- ブログ(8419)
- 使用(2402)
- 削減(699)
- 学習(843)
- 推論(71)
- 深層(70)
Amazon Elastic Inference を使ったモデルサービング | Amazon Web Services ブログ
- Amazon(8960)
- Elastic(147)
- Inference(33)
- Services(7017)
- Web(9895)
- サー(99)
- ビング(2)
- ブログ(8419)
- モデル(1257)
Amazon Elastic Inference を使用して ONNX モデルを実行する | Amazon Web Services ブログ
- Amazon(8960)
- Elastic(147)
- Inference(33)
- ONNX(9)
- Services(7017)
- Web(9895)
- ブログ(8419)
- モデル(1257)
- 使用(2402)
- 実行(943)
EI 対応の TensorFlow 1.12 で利用できる柔軟性のある新型 Python API を使用して、Amazon Elastic Inference で TensorFlow モデルをデプロイする | Amazon Web Services ブログ
- 1.12(2)
- Amazon(8960)
- API(1145)
- EI(3)
- Elastic(147)
- Inference(33)
- Python(181)
- Services(7017)
- TensorFlow(48)
- Web(9895)
- デプロイ(188)
- ブログ(8419)
- モデル(1257)
- 使用(2402)
- 利用(5318)
- 対応(5118)
- 新型(1388)
- 柔軟性(20)
インテル、機械学習の推論に特化した新プロセッサ「Nervana Neural Network Processor for Inference」発表。Facebookが開発に協力－ Publickey
- Facebook(1704)
- for(5515)
- Inference(33)
- Nervana(2)
- Network(416)
- neural(23)
- processor(32)
- Publickey(3052)
- インテル(238)
- プロセッサ(160)
- 協力(407)
- 学習(843)
- 推論(71)
- 機械(475)
- 特化(616)
- 発表(8396)
- 開発(6923)
Amazon Elastic Inference — GPUを利用した深層学習推論の高速化 | Amazon Web Services ブログ
- Amazon(8960)
- Elastic(147)
- GPU(235)
- Inference(33)
- Services(7017)
- Web(9895)
- ブログ(8419)
- 利用(5318)
- 学習(843)
- 推論(71)
- 深層(70)
- 高速(842)
Apache MXNet を Amazon SageMaker および AWS Greengrass ML Inference と共に使用する脳組織のセグメント化 – パート 2 | Amazon Web Services ブログ
- Amazon(8960)
- apache(544)
- AWS(4340)
- Greengrass(36)
- Inference(33)
- ML(99)
- MXNet(37)
- SageMaker(373)
- Services(7017)
- Web(9895)
- セグメント(63)
- パート(109)
- ブログ(8419)
- 使用(2402)
- 脳組織(3)
Apache MXNet を Amazon SageMaker および AWS Greengrass ML Inference と共に使用する脳組織のセグメント化 – パート 1 | Amazon Web Services ブログ
- Amazon(8960)
- apache(544)
- AWS(4340)
- Greengrass(36)
- Inference(33)
- ML(99)
- MXNet(37)
- SageMaker(373)
- Services(7017)
- Web(9895)
- セグメント(63)
- パート(109)
- ブログ(8419)
- 使用(2402)
- 脳組織(3)
AWS Greengrass Machine Learning Inference – アマゾンウェブサービス
- AWS(4340)
- Greengrass(36)
- Inference(33)
- Learning(126)
- Machine(120)
- アマゾン(484)
- ウェブ(1051)
- サービス(19742)