Palmyra X5, a model developed to efficiently drive multi-step agents, is now available exclusively through Writer and Amazon Bedrock in a fully managed manner. Amazon Cloud Technologies has announced the official launch of Palmyra X5 – a new adaptive inference model with a million token context window – on Amazon Bedrock. The model, published by Writer, a leader in enterprise-grade generative AI, is one of the first to offer such a large-scale context window on Amazon Bedrock. The model is optimized for speed and cost-efficiency, enabling customers to build advanced multi-step AI agents that can precisely process large amounts of enterprise data, fundamentally changing the way inference is done. Amazon Cloud Technology is now the first and only cloud provider to offer Writer’s fully hosted, serverless model, including the latest Palmyra X5 and Palmyra X4, with more models coming soon. As generative AI technology accelerates, customers need a wide selection of models to precisely match their business needs. The launch of the Writer model in Amazon Bedrock further enriches Amazon Bedrock’s extensive selection of fully managed models from leading AI companies, helping customers more easily and securely build and scale generative AI applications to drive business transformation and innovation. Palmyra X5 is one of the first models to offer a million token context window on Amazon Bedrock, providing Amazon Cloud Technology customers with more choice (context window refers to the amount of information a model can process and “remember” per input/request. It is measured in token, the smallest text unit processed by the model, which can be regarded as the “short-term memory” of the model). With a context window of this size, Palmyra can accurately process 1,500 pages of content (equivalent to 6 books). The model is also one of the first enterprise-grade adaptive inference models in the industry, combining advanced large language model capabilities with extended memory and processing capabilities. Enterprises can now handle a wide range of tasks within their budgets, including financial reporting, legal contract analysis, medical record consolidation, customer feedback mining, and more. As well as reasoning, the Palmyra X5 has a number of powerful features, including support for agents interacting with the system, advanced code generation and deployment, and support for more than 30 languages. Description of Palmyra X5: If you anthropomorphize the Palmyra X5 model, it has superpowers – it can read a million words in 22 seconds and generate actionable insights on the fly. Not only can it memorize the entire 200-page strategy document in its entirety, but it can also understand its intrinsic relevance to yesterday’s client meeting and last quarter’s financial data. When faced with complex problems, it can systematically and incrementally advance solutions, articulating a clear thinking path throughout – whether it is helping to analyze massive customer feedback to distill commonalities, or troubleshooting technical glitches. Waseem AlShikh, chief technology officer and co-founder of Writer, said: “We chose Amazon Cloud Technology as the first mainstream Cloud as a Service provider to offer Writer’s fully managed model because of its unparalleled security and a strong fit in our vision to transform the way AI is used in enterprises and drive innovative growth. The Palmyra X5 is Writer’s most advanced model to date, capable of processing massive amounts of enterprise data at high speed, which is critical for scaling multi-agent systems. With Amazon Bedrock, we are bringing these powerful capabilities to more businesses around the world, helping customers deploy in a secure, scalable environment.” Atul Deo, Director of Amazon Bedrock, Amazon Cloud Technology, said: “Based on our deep strategic partnership with Writer, we are excited to offer Writer’s Palmyra family of models through Amazon Bedrock, empowering enterprises to usher in a new era of intelligent agent innovation. Palmyra X5 delivers superior performance in a long context window with enterprise-grade reliability and speed. Palmyra X5, seamlessly connected to Writer, will enable developers and enterprises to leverage the security, scalability and performance of Amazon Cloud Technology to build and scale AI agents that revolutionize the inference paradigm for massive enterprise data.” Data parsing: Palmyra X5 is one of the most efficient large-scale contextual big language models, optimized for speed and cost. It can process the full million token prompt words in about 22 seconds, and a single function call response takes only about 0.3 seconds. In the latest Longbench v2 review, Palmyra X5 demonstrated its class-leading price/performance ratio with an average score of 53%. Enterprises can achieve near-top accuracy while significantly reducing the cost per million tokens. It can perform a large number of agents and long context processing tasks under controlled budgets. Supports more than 30 languages, providing true multilingual processing power to enterprises worldwide. Priced at $0.60 per million input tokens and $6 per million output tokens, it is one of the most cost-effective large-scale contextual large language models available. In the BigCodeBench (full version, command version) evaluation, Palmyra X5 ranked 48.7 out of the top model, demonstrating its ability to solve practical and challenging complex programming tasks. While generative AI is changing the way we create, analyze, and interact with information, Agentic AI will fundamentally reshape the nature of work. This new frontier in AI goes beyond content generation and insight refining to AI agents that can autonomously plan, execute, and adjust complex action sequences. With Palmyra X5 from Amazon Bedrock, Amazon Cloud Technology customers can use Writer’s models to build and scale AI agents securely and privately, without managing the underlying infrastructure. In addition, the most exciting aspect of Palmyra X5 for companies across industries is the ability to build and deploy more sophisticated AI agents that can process vast amounts of data and interact with other agents, big language models, and external system tools. Writer provides accurate and fully autonomous models that eliminate post-training quantization and knowledge distillation, ensuring that the behavioral patterns validated today are consistent with those of tomorrow. Palmyra X5 builds on this commitment to strengthen technology, maintaining strict backward compatibility to avoid the pain of repeated team tuning processes, publishing a publicly available enterprise technology roadmap that customers can participate in, and optimizing inference latency to enable near-instantaneous responses to large language model interaction and retrieval enhancement generation (RAG), even at the order of millions of tokens. Writer announced that thanks to its innovative Transformer design (an architecture that supports input data parallelism rather than sequential processing) and hybrid attention mechanism (which allows multiple ways to focus on information simultaneously, ensuring both efficiency and effectiveness), all of its future big language models will be released with one million tokens as the minimum context window size. This means that enterprises can develop long-term strategies based on continuously expanding AI capabilities, without being limited by the size constraints of the context window. Visit the Amazon Cloud Technology News Blog for details on Palmyra X5, including the model’s deployment approach and potential use cases in Amazon Bedrock, and check out the Writer product page in Amazon Bedrock. Visit the web link now {Amazon Bedrock Console} Start using Palmyra X5 and Palmyra X4 About Amazon Cloud Technology Since 2006, Amazon Web Services has been renowned for its technological innovation, rich service offerings, and wide range of applications. Amazon Cloud Technology has been continuously expanding its portfolio of services to support almost any workload on the cloud, and currently offers more than 240 full-featured services, covering compute, storage, database, networking, data analytics, machine learning and artificial intelligence, Internet of Things, mobile, security, hybrid cloud, media, and application development, deployment, and management; infrastructure spans 114 Availability Zones in 36 geographic regions, and has announced plans for 4 new regions and 12 new Availability Zones, including New Zealand and Saudi Arabia. Millions of customers around the world, including fast-growing startups, large enterprises, and leading government entities, rely on Amazon Cloud Technology to support their infrastructure, improve agility, and reduce costs through Amazon Cloud Technology’s services. To learn more about Amazon Cloud Technology
发表回复