AI agents, multimodal Phi-3 unveiled at Microsoft Build 2024

Published on:

Satya Nadella used his keynote tackle on Day 1 of Microsoft’s Construct Developer Convention to announce some thrilling new AI developments that can quickly be typically accessible.

Microsoft Construct is an annual convention the place builders get to see the most recent developments in Home windows 11 and Microsoft 365. The primary day noticed the revealing of some fascinating generative AI instruments.

Group Copilot

In 2023 Microsoft launched its Copilot chatbot which gives real-time clever help when you work with Microsoft 365 instruments like Phrase, Excel, PowerPoint, Outlook, or Groups.

- Advertisement -

Nadella introduced that it was getting a big AI improve with Group Copilot. Group Copilot expands Copilot from a person private assistant to develop into a part of a staff, enhancing collaboration and mission administration.

For those who’re working as a part of a staff utilizing Microsoft Groups, Microsoft Loop, or Microsoft Planner, Group Copilot can facilitate conferences by managing the agenda and taking notes. It may well spotlight necessary data, monitor motion gadgets, and tackle unresolved points.

It may well even act as a mission supervisor assigning duties, monitoring deadlines, and notifying staff members when their enter is required.

- Advertisement -

Customized copilot brokers

Microsoft Copilot Studio will allow you to construct customized copilots that act as brokers that work independently after you give them directions.

Utilizing a pure language immediate you merely describe what you need the agent to do after which deploy it on a number of platforms.

Microsoft says these brokers can:

  • Automate long-running enterprise processes
  • Motive over actions and consumer inputs
  • Leverage reminiscence to herald context
  • Be taught primarily based on consumer suggestions
  • File exception requests and ask for assist.
See also  People are using AI music generators to create hateful songs

An instance of the utility an agent like this might present is an “order-taker” copilot that Microsoft says may “deal with the end-to-end order success course of—from taking the order to processing the order and making clever suggestions and substitutions for out-of-stock gadgets to delivery it to the client.”

This performance lets you create digital workers to deal with menial duties like monitoring emails, information entry, or different repetitive duties with out including to your employees headcount.

Phi-3 Imaginative and prescient

Microsoft has added a 4.2B parameter multimodal mannequin to its Phi-3 household of small language fashions (SLMs). Phi-3 Imaginative and prescient is a low-cost and low-latency mannequin that has audio and imaginative and prescient capabilities and a 128k context window.

- Advertisement -

These smaller fashions are geared toward on-device options the place velocity, price, compute, and web connectivity constraints make bigger fashions impractical. The Phi-3 SLMs show superior reasoning skills and outperform a number of bigger fashions.

Enabling on-device multimodal reasoning opens up thrilling purposes in healthcare, schooling, and agriculture, particularly for rural areas with no web connectivity.

You’ll be able to check out Phi-3 Imaginative and prescient right here. It does a fantastic job of analyzing photographs, extracting textual content, and even translation.

Phi-3 Imaginative and prescient benchmark outcomes in comparison with different AI fashions. Supply: Microsoft

Superior Paste

Home windows 11 now has a better approach to copy and paste. The brand new Superior Paste characteristic offers you extra choices for information that you just copy to the clipboard. Whenever you press Home windows Key + Shift + V you might be introduced with choices to stick as plain textual content, as markdown, or as JSON.

See also  Sierra’s new benchmark reveals how well AI agents perform at real work

You may also sort an outline of the way you need the copied textual content to be processed earlier than pasting.

You’ll want an OpenAI API key and credit in your account to make use of this characteristic. It simply saves you the difficulty of pasting the textual content into ChatGPT and prompting it to format it there, earlier than copying and pasting it again into your doc.

- Advertisment -

Related

- Advertisment -

Leave a Reply

Please enter your comment!
Please enter your name here