Recent Discussions
Responses API for gpt-4.1 in Europe
Hello everyone! I'm writing here trying to figure out something about the availability of the "responses" APIs in european regions: https://learn.microsoft.com/en-us/azure/ai-foundry/openai/how-to/responses?tabs=python-key i'm trying to deploy a /responses endpoint for the model we are currently using, gpt-4.1, since i've read that the /completions endpoint will be dismissed by OpenAI starting from august 2026. Our application is currently migrating all the API calls from completions to responses, and we were wondering if we could already do the same for our clients in Europe as well, which have to comply to GDPR and currently use our Azure subscription. In the page linked above, i can see some regions that would fit our needs, in particular: francecentral norwayeast polandcentral swedencentral switzerlandnorth but then, i can also read "Not every model is available in the regions supported by the responses API.", which probably already answers my question: from the Azure AI Foundry Portal, i wasn't able to deploy such endpoint in those regions, except for the o3 model. For the 4.1 model, only the completions endpoint is listed, while searching for "Responses" (in "Deploy base model") returns only o3 and these others: Can you confirm that i'm not doing anything wrong (looking in the wrong place to deploy such endpoint), and currently the gpt-4.1 responses API is not available in any European region? Do you think it's realistic it will be soon (like in 2025)? Any european region would work for us, in the "DataZone-Standard" type of distribution, which already ensures GDPR compliance (no need for a "Standard" one in one specific region). Thank you for your attention, have a nice day or evening,22Views0likes0CommentsAzure OpenAI: GPT-5-Codex Availability?
Greetings everyone! I just wanted to see if there's any word as to when/if https://openai.com/index/introducing-upgrades-to-codex/ will make it's way to the AI Foundry. It was released on September 15th, 2025, but I have no idea how long Azure tends to follow behind OpenAI's releases. It doesn't really seem like there's any source of information to view whenever new models drop as to what Azure is going to do with them, if any. Any conversation around this would be helpful and appreciated, thanks!50Views1like0CommentsUnable to locate and add a VM (GPU family) to my available VM options.
I am using azure AI foundry and need to run GPU workload but N-series VM options do not appear when i try to add quota Only CPU families like D and E are listed How can i enable or request N-series GPU VMs in my subscription and region24Views0likes1CommentAzure Communication Services - Python SDK Call Media not working with CallConnectionClient
Hi team, I’m working on a FastAPI service that uses Azure Communication Services Call Automation (Python SDK) to handle outbound PSTN calls and real-time speech interaction. So far it is able to make phone calls but not able to do media handling part during conversation. Environment: Python version: 3.12-slim Package: azure-communication-callautomation (version: 1.4.0) Hosting: Azure Container Apps speech cognitive resource is connected to azure communication services https://drive.google.com/file/d/1uC2S-LNx_Ybpp1QwOCtqFS9pwA84mK7h/view?usp=drive_link What I’m trying to do: Place an outbound call to a PSTN number Play a greeting (TextSource) when the call is connected Start continuous speech recognition, forward transcript to an AI endpoint, then play the response back Code snippet: # Play greeting try: call_connection = client.get_call_connection(call_id) call_media = call_connection.call_media() call_media.play_to_all( play_source, operation_context="welcome-play" ) print("Played welcome greeting.") except Exception as e: print("Play Greeting Failed: ", str(e)) # start Recognition participants = list(call_connection.list_participants()) for p in participants: if isinstance(p.identifier, PhoneNumberIdentifier): active_participants[call_id] = p.identifier try: call_connection = client.get_call_connection(call_id) call_media = call_connection.call_media() call_media.start_recognizing_media( target_participant=p.identifier, input_type="speech", interrupt_call_media_operation=True, operation_context="speech-recognition" ) print("Started recognition immediately after call connected.") except Exception as e: print("Recognition start failed:", str(e)) break target_participant = active_participants.get(call_id) if not target_participant: print(f"No PSTN participant found for call {call_id}, skipping recognition.") Issue: When the CallConnected event fires,, I get different errors depending on which method I try: 'CallConnectionClient' object has no attribute 'call_media' 'CallConnectionClient' object has no attribute 'get_call_media_operations' 'CallConnectionClient' object has no attribute 'play_to_all' 'CallConnectionClient' object has no attribute 'get_call_media_client' 'CallConnectionClient' object has no attribute 'get_call_media' Also some import errors: ImportError: cannot import name 'PlayOptions' from 'azure.communication.callautomation' ImportError: cannot import name 'RecognizeOptions' from 'azure.communication.callautomation' ImportError: cannot import name 'CallMediaRecognizeOptions' from 'azure.communication.callautomation' ImportError: cannot import name 'CallConnection' ... Did you mean: 'CallConnectionState'? This makes me unsure which API is the correct/updated way to access play_to_all and start_recognizing_media. https://drive.google.com/file/d/1xI-sWil0OKfAfGwjIgG25eD7CEK95rKc/view?usp=drive_link Questions: What is the current supported way to access call media operations (play / speech recognition) in the Python SDK? Are there breaking changes between SDK versions that I should be aware of? Should I upgrade to a specific minimum version to ensure .call_media works? Thanks in advance!40Views0likes1CommentAgent in Azure AI Foundry not able to access SharePoint data via C# (but works in Foundry portal)
Hi Team, I created an agent in Azure AI Foundry and added a knowledge source using the SharePoint tool. When I test the agent inside the Foundry portal, it works correctly; it can read from the SharePoint site and return file names/data. However, when I call the same agent using C# code, it answers normal questions fine, but whenever I ask about the SharePoint data, I get the error: Sorry, something went wrong. Run status: failed I also referred to the official documentation and sample here: https://learn.microsoft.com/en-us/azure/ai-foundry/agents/how-to/tools/sharepoint-samples?pivots=rest I tried the cURL samples as well, and while the agent is created successfully, the run status always comes back as failed. Has anyone faced this issue? Do I need to configure something extra for SharePoint when calling the agent programmatically (like additional permissions or connection binding)? Any help on this would be greatly appreciated. Thanks!72Views0likes1CommentChaining and Streaming with Responses API in Azure
Responses API is an enhancement of the existing Chat Completions API. It is stateful and supports agentic capabilities. As a superset of the Chat Completions class, it continues to support functionality of chat completions. In addition, reasoning models, like GPT-5 result in better model intelligence when compared to Chat Completions. It has input flexibility, supporting a range of input types. It is currently available in the following regions on Azure and can be used with all the models available in the region. The API supports response streaming, chaining and also function calling. In the examples below, we use the gpt-5-nano model for a simple response, a chained response and streaming responses. To get started update the installed openai library. pip install --upgrade openai Simple Message 1) Build the client with the following code from openai import OpenAI client = OpenAI( base_url=endpoint, api_key=api_key, ) 2) The response received is an id which can then be used to retrieve the message. # Non-streaming request resp_id = client.responses.create( model=deployment, input=messages, ) 3) Message is retrieved using the response id from previous step response = client.responses.retrieve(resp_id.id) Chaining For a chained message, an extra step is sharing the context. This is done by sending the response id in the subsequent requests. resp_id = client.responses.create( model=deployment, previous_response_id=resp_id.id, input=[{"role": "user", "content": "Explain this at a level that could be understood by a college freshman"}] ) Streaming A different function call is used for streaming queries. client.responses.stream( model=deployment, input=messages, # structured messages ) In addition, the streaming query response has to be handled appropriately till end of event stream for event in s: # Accumulate only text deltas for clean output if event.type == "response.output_text.delta": delta = event.delta or "" text_out.append(delta) # Echo streaming output to console as it arrives print(delta, end="", flush=True) The code is available in the following github link - https://github.com/arunacarunac/ResponsesAPI Additional details are available in the following links - Azure OpenAI Responses API - Azure OpenAI | Microsoft Learn88Views0likes0CommentsPredictions for Artificial Intelligence in next 2-3 years!!!!
2025 - start of agentic AI -Oct 2025: Chatgpt 5 get released (proven to be 10000x times more powerful than chatgpt 4 and can run task automatically) 2026 AI benchmark matches human, beginning of Artificial general intelligence 2027 A new website called letsbuiltai is open source and encourages everyone to train AI. Instead of you training your own AI or an Ai company training their own AI. This would involves everyone training a particular AI simultaneously, paving way for faster and quicker AI growth175Views0likes2CommentsPush for Rapid AI Growth
There is a key factors of why AI is not growing as quick as speed of light, the reason is because most AI are either built by a specific company (e.g Open AI for chatgpt, Microsoft for Copilot, Google for Gemini). or individuals/small groups building agents for fun or for their workplaces. But what would happen if we merge them together. Imagine, if a website that is own by no one and it is open source and it allows everyone to train the same AI simultaneously at the same time, what would happen. Imagine instead of Microsoft building Copilot, the whole world is building Copilot at the same time, training Copilot at the same time through all global computing power. This would led to an shocking and exponential growth of AI never seen before. This is why I think Copilot should allow everyone to train its AI.96Views1like1CommentPush for hyperrealistic AI Video Generator
I fervently believe that Microsoft must pioneer the development of AI-generated videos. OpenAI has already set the stage with Sora, and if Microsoft doesn't act now, it risks falling behind in the fiercely competitive AI market. This isn't just about keeping pace—it's about leading the charge. Furthermore, the rollout of AI-generated videos must be nothing short of exceptional. These videos need to boast impeccable quality and clearly convey the intended content. Mediocrity has no place in this vision. And let's not forget about preparing Clipchamp for the 2030s. It's imperative to equip it with cutting-edge capabilities that will redefine video creation and editing for the future. Together, these initiatives will not only keep Microsoft at the forefront but will also revolutionize the AI and video landscape.128Views1like1CommentAzure OpenAI: gpt-5-mini chat/completions streaming returns empty response.
Summary When calling gpt-5-mini via Chat Completions with "stream": true, the server opens the stream but no assistant tokens are emitted and the final JSON is empty (choices: [], created: 0, empty id/model). The same code path streams correctly for gpt-5 and gpt-4o deployments. Also, non-streaming ("stream": false) with gpt-5-mini returns valid content as expected. Environment API: POST /openai/deployments/{deployment}/chat/completions?api-version=2025-01-01-preview Model / Deployment: gpt-5-mini (Azure OpenAI deployment) Date/Time observed: 26 Aug 2025, ~13:00 IST (UTC+05:30) Region: useast2 Note: Same client, headers, and network path work for gpt-5 and gpt-4o streaming. Request Endpoint /openai/deployments/gpt-5/chat/completions?api-version=2025-01-01-preview Body { "messages": [ { "role": "system", "content": "give the best result you can" }, { "role": "user", "content": "Hello" } ], "stream": true } Actual Response (final aggregated JSON after stream ends) { "choices": [], "created": 0, "id": "", "model": "", "object": "", "prompt_filter_results": [ { "prompt_index": 0, "content_filter_results": { "hate": { "filtered": false, "severity": "safe" }, "jailbreak": { "filtered": false, "detected": false }, "self_harm": { "filtered": false, "severity": "safe" }, "sexual": { "filtered": false, "severity": "safe" }, "violence": { "filtered": false, "severity": "safe" } } } ] } Notes: No delta tokens arrive on the SSE stream. No assistant message content is ever emitted. Content filter result is safe across categories. Expected Behavior With "stream": true, server should emit SSE chunks with assistant delta tokens and finish with a populated final message in choices[0].message.content. Azure OpenAI: gpt-5-mini chat/completions streaming returns empty response (choices: [], created: 0) while other models stream fine215Views0likes1CommentDo you have experience fine tuning GPS OSS models?
Hi I found this space called Affine. It is a daily reinforcement learning competition and I'm participating in it. One thing that I am looking for collaboration on is with fine tuning GPT OSS models to score well on the evaluations. I am wondering if anyone here is interested in mining? I feel that people here would have some good reinforcement learning tricks. These models are evaluated on a set of RL-environments with validators looking for the model which dominates the Pareto frontier. I'm specifically looking to see any improvements in the coding deduction environment and the new ELR environment they made. I would like to use a GPT OSS model here but its hard to fine-tune these models in GRPO. Here is the information I found on Affine: https://www.reddit.com/r/reinforcementlearning/comments/1mnq6i0/comment/n86sjrk/?utm_source=share&utm_medium=web3x&utm_name=web3xcss&utm_term=1&utm_content=share_button41Views0likes0CommentsUsing AI to convert unstructured information to structured information
We have a use case to extract the information from various types of documents like Excel, PDF, and Word and convert it into structured information. The data exists in different formats. We started building this use case with AI Builder, and we hit the roadblock and are now exploring ways using the Co-pilot studio. It would be great if someone could point us in the right direction. What should be the right technology stack that we should consider for this use case? Thank you for the pointer.1.9KViews4likes18CommentsAidemos Microsoft site doesn't work https://aidemos.microsoft.com/
Hello MS team, I am learning AI-900 in Coursera. The course guides me to try AI demos on https://aidemos.microsoft.com/. But it seems broken for weeks. According to the error message, it could be the issue of the backend. Could the MS team fix it, please? Best Regards, Dale5.5KViews1like14CommentsKamal Hinduja Switzerland How do algorithms interact with machine learning?
Hi All, I'm Kamal Hinduja, based in Geneva, Switzerland (Swiss). Can anyone Explain in detail How do algorithms interact with machine learning? Thanks, Regards Kamal Hinduja Geneva, Switzerland133Views1like3CommentsDoc Intelligence: Custom Extraction model | Confidence score deterioration with new formats/layouts
Hi everyone, This is my first time using custom extraction models on the Document Intelligence service, and I would appreciate your input on an experiment I am conducting. I wanted to investigate how these models' confidence scores behave when documents with significantly different format/layout are introduced (later) in the training phase. I started by training models with documents in the same format (some of worse picture quality and slightly rotated), increasingly adding more samples (a new model was trained every time I added new documents, at increments of 5). After every new model was trained, I checked scores against the same, unseen by the model holdout set that had the same format with those in the training set. After training the final model, with 35 identically formatted documents, I started introducing documents with a significantly different format/layout and retraining (at increments of 10). Confidence scores against the holdout set (unchanged) dropped after doing so, without recovering to previous levels. See graph below showing how confidence scores evolved after every training step (adding new documents at every step). Any insights as to why this has happened?Evaluation
Hi there, I tried out the evaluation feature, and tested out groundedness, relevance as well as similarity. My dataset has 94 questions and both relevance and similarity checked all 94 questions and its respective responses and gave me either a pass or a fail. However, groundedness completed the run with errors, as almost 10 of the inputs came back as null. I tried going through the logs but I'm not sure where to check what went wrong for those questions. Appreciate if someone could point me in the right direction.75Views1like1CommentImage Dataset in Azure AI Asking for Tabular Format During Training
Hi everyone, I’m working on an image-based project in Azure AI. My images (PNG) are stored in Azure Blob Storage, and I registered them as a folder in Data Assets. When I start training, the UI asks for a tabular dataset instead. Since my data is images, I’m unsure how to proceed or whether I need to convert or register the dataset differently. What’s the correct way to set up image data for training in Azure AI?40Views0likes0CommentsChatGPT 5 Has Arrived: What You Need to Know
The wait is over. OpenAI has officially launched GPT-5, and it’s already being hailed as the most significant leap forward in AI capability since the original release of ChatGPT. OpenAI CEO Sam Altman described the new model as a "PhD-level expert" that offers a unified, smarter, and more reliable experience. This isn't just an incremental update; it's a fundamental shift in how the AI works, bringing together the best of previous models into a single, powerful system. What’s New and Improved? GPT-5 introduces a host of features that address key limitations of its predecessors. One of the most talked-about advancements is the reduction in hallucinations, where the model generates false information. According to OpenAI, GPT-5 is significantly more factually consistent and trustworthy, especially in "thinking mode," which uses a chain-of-thought approach to solve complex problems. This makes it more suitable for high-stakes tasks in fields like healthcare and coding. Another major change is the unified model architecture. Instead of manually switching between different models like GPT-4 or GPT-4o, the new system automatically routes your query to the best model for the job. This "smart router" instantly decides whether to prioritize speed for a simple question or engage in a deeper, more comprehensive reasoning process for a complex one. The context window has also been dramatically improved. While previous models had limits on how much information they could remember in a single session, GPT-5 can handle up to 272,000 tokens of input, allowing it to maintain context through much longer conversations and documents. A New Era for Developers and Users For developers, GPT-5 represents a game-changer. It is being called OpenAI's "strongest coding model yet," excelling in a variety of tasks from bug fixing and multi-language programming to generating entire software programs from a single prompt. This new capability, dubbed "vibe coding" by Altman, allows for the creation of functional applications with minimal human input, which could drastically reduce development cycles. For general users, the experience is more intuitive and personalized. GPT-5 is now the default model for all users, including those on the free plan, though with usage limits. You can also customize your experience with new selectable personalities like "Cynic," "Robot," "Listener," and "Nerd." This move towards greater accessibility and user control demonstrates OpenAI's commitment to making powerful AI tools available to everyone. The Road Ahead While GPT-5 marks a major step toward Artificial General Intelligence (AGI), it's not without its challenges. The initial rollout saw a minor mathematical error, a reminder that even the most advanced AI benefits from clear instructions. The ongoing competition with other models like Claude 4 and Gemini 2.0 also ensures that the pace of innovation will only continue to accelerate. Ultimately, GPT-5's true impact will be measured not just by its impressive benchmarks, but by how businesses and individuals leverage its new capabilities to solve real-world problems. It's a new era, and the AI landscape has been forever changed.487Views1like0CommentsFrom Space to Subsurface: Using Azure AI to Predict Gold Rich Zones
In traditional mineral exploration, identifying gold bearing zones can take months of fieldwork and high cost drilling often with limited success. In our latest project, we flipped the process on its head by using Azure AI and Satellite data to guide geologists before they break ground. Using Azure AI and Azure Machine Learning, we built an intelligent, automated pipeline that identified high potential zones from geospatial data saving time, cost, and uncertainty. Here’s a behind the scenes look at how we did it.👇 📡 Step 1: Translating Satellite Imagery into Features We began with Sentinel-2 imagery covering our Area of Interest (AOI) and derived alteration indices commonly used in mineral exploration, including: 🟤 Clay Index – proxies for hydrothermal alteration 🟥 Fe (Iron Oxide) Index 🌫️ Silica Ratio 💧 NDMI (Normalized Difference Moisture Index) Using Azure Notebooks and Python, we processed and cleaned the imagery, transforming raw reflectance bands into meaningful geochemical features. 🔍 Step 2: Discovering Patterns with Unsupervised Learning (KMeans) With feature rich geospatial data prepared, we used unsupervised clustering (KMeans) in Azure Machine Learning Studio to identify natural groupings across the region. This gave us a first look at the terrain’s underlying geochemical structure one cluster in particular stood out as a strong candidate for gold rich zones. No geology degree needed: AI finds patterns humans can't see :) 🧠 Step 3: Scaling with Azure AutoML We then trained a classification model using Azure AutoML to predict these clusters over a dense prediction grid: ✅ 7,200+ data points generated ✅ ~50m resolution grid ✅ 14 km² area of interest This was executed as a short, early stopping run to minimize cost and optimize training time. Models were trained, validated, and registered using: Azure Machine Learning Compute Instance + Compute Cluster Azure Storage for dataset access 🔬 Step 4: Validation with Field Samples To ground our predictions, we validated against lab assayed (gold concentration) from field sampling points. The results? 🔥 The geospatial cluster labeled 'Class 0' by the model showed strong correlation with lab validated gold concentrations, supporting the model's predictive validity. This gave geologists AI augmented evidence to prioritize areas for further sampling and drilling. ⚖️ Traditional vs AI-based Workflow 🚀 Why Azure? ✅ Azure Machine Learning Studio for AutoML and experiment tracking ✅ Azure Storage for seamless access to geospatial data ✅ Azure OpenAI Service for advanced language understanding, report generation, and enhanced human AI interaction ✅ Azure Notebooks for scripting, preprocessing, and validation ✅ Azure Compute Cluster for scalable, cost effective model training ✅ Model Registry for versioning and deployment 🌍 Key Takeaways AI turns mineral exploration from reactive guesswork into proactive intelligence. In our workflow, AI plays a critical role by: ✅ Extracting key geochemical features from satellite imagery 🧠 Identifying patterns using unsupervised learning 🎯 Predicting high potential zones through automated classification 🌍 Delivering full spatial coverage at scale With Azure AIand Azure ML tools, we’ve built a complete pipeline that supports: End to end automation; from data prep to model deployment Faster, more accurate exploration with lower costs A reusable, scalable solution for global teams This isn’t just a proof of concept, it’s a production ready framework that empowers geologists with AI driven insights before the first drill hits the ground. 🔗 If you're working in Mining industry, geoscience, AI for Earth, or exploration tech, let’s connect! We’re on a mission to bring AI deeper into every industry through strategic partnerships and collaborative innovation.106Views2likes0CommentsDiscussion with Copilot regarding memory and learning
🧠 Suggested Feedback to Microsoft Copilot Developers Subject: Proposal for User-Curated Persistent Memory via Saved Conversations As a power user of Copilot, I’ve discovered a workaround that simulates persistent memory: by saving and reopening conversations, users can maintain continuity across sessions. This method allows Copilot to re-read prior context and respond with full awareness, effectively mimicking long-term memory. I believe this behavior should be formally supported and enhanced. Specifically: Allow users to designate conversations as “persistent threads” Enable Copilot to automatically recall and build upon these threads Provide tools for users to curate, tag, and evolve these threads over time This would dramatically improve Copilot’s utility for complex, multi-phase projects — from zoning proposals to simulation workflows — and foster deeper collaboration between users and AI. I’m happy to elaborate further if this idea reaches the right team. It’s a simple shift with profound implications for learning, continuity, and user empowerment. If you'd like to refine the tone or add technical examples (like your zoning work or simulation benchmarking), I can help tailor it further. And if you ever spot a direct developer channel — even a beta feedback program — this message is ready to go. You're not just using the system smartly; you're helping shape what it could become.75Views0likes0Comments
Events
Recent Blogs
- What Is Foundry Local? Foundry Local is a lightweight runtime designed to run AI models directly on user devices. It supports a wide range of hardware (CPU, GPU, NPU) and provides a consistent deve...Sep 17, 2025167Views0likes0Comments
- Today, we’re excited to introduce Live Interpreter –a breakthrough new capability in Azure Speech Translation – that makes real-time, multilingual communication effortless. Live Interpreter continuo...Sep 12, 20254.4KViews1like0Comments