{"id":2145,"date":"2025-03-11T16:45:00","date_gmt":"2025-03-11T08:45:00","guid":{"rendered":"https:\/\/tongwing.woon.sg\/blog\/?p=2145"},"modified":"2025-03-11T16:58:08","modified_gmt":"2025-03-11T08:58:08","slug":"deepseek-r1-now-available-as-a-fully-managed-serverless-model-in-amazon-bedrock-aws-news-blog","status":"publish","type":"post","link":"https:\/\/tongwing.woon.sg\/blog\/deepseek-r1-now-available-as-a-fully-managed-serverless-model-in-amazon-bedrock-aws-news-blog\/","title":{"rendered":"DeepSeek-R1 now available as a fully managed serverless model in Amazon Bedrock"},"content":{"rendered":"<p><a href=\"https:\/\/tongwing.woon.sg\/blog\/cheapest-way-to-run-deepseek-r1-in-aws\/\">In my previous writeup<\/a>, I wrote that you have to spent a lot (using GPU), or put up with very slow performance (using CPU) if you wanted to use Deepseek R1 on AWS. Not anymore. AWS now offers Deepseek R1 as a base model starting from 10 Mar 2025 (in selected regions). Check out AWS blog on the <a href=\"https:\/\/aws.amazon.com\/blogs\/aws\/deepseek-r1-now-available-as-a-fully-managed-serverless-model-in-amazon-bedrock\/\">demo walkthrough<\/a>.<\/p>\n<p><a href=\"https:\/\/aws.amazon.com\/blogs\/aws\/deepseek-r1-now-available-as-a-fully-managed-serverless-model-in-amazon-bedrock\/\"><img decoding=\"async\" class=\"alignnone size-full\" src=\"https:\/\/tongwing.woon.sg\/blog\/wp-content\/uploads\/2025\/03\/2025-deepseek-r1-on-bedrock-1-model-access-1.jpg\" alt=\"\" \/><\/a><\/p>\n<p>Just take note that you may have to increase the maximum output length in order to complete your request &#8211; this applies to most reasoning models. In my test, output was abruptly stopped halfway, as the default output token length is only 4096. Extending the output length solves the problem.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>In my previous writeup, I wrote that you have to spent a lot (using GPU), or put up with very slow performance (using CPU) if you wanted to use Deepseek R1 on AWS. Not anymore. AWS now offers Deepseek R1 as a base model starting from 10 Mar 2025 (in selected regions). Check out AWS [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[34,24],"tags":[],"_links":{"self":[{"href":"https:\/\/tongwing.woon.sg\/blog\/wp-json\/wp\/v2\/posts\/2145"}],"collection":[{"href":"https:\/\/tongwing.woon.sg\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/tongwing.woon.sg\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/tongwing.woon.sg\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/tongwing.woon.sg\/blog\/wp-json\/wp\/v2\/comments?post=2145"}],"version-history":[{"count":4,"href":"https:\/\/tongwing.woon.sg\/blog\/wp-json\/wp\/v2\/posts\/2145\/revisions"}],"predecessor-version":[{"id":2153,"href":"https:\/\/tongwing.woon.sg\/blog\/wp-json\/wp\/v2\/posts\/2145\/revisions\/2153"}],"wp:attachment":[{"href":"https:\/\/tongwing.woon.sg\/blog\/wp-json\/wp\/v2\/media?parent=2145"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/tongwing.woon.sg\/blog\/wp-json\/wp\/v2\/categories?post=2145"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/tongwing.woon.sg\/blog\/wp-json\/wp\/v2\/tags?post=2145"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}