Deploying BGE-M3 and other embedding models on Azure Machine Learning with Managed Online Endpoints22 October 2024·5 mins
Sharing Azure OpenAI Provisioned Throughput (PTU) for multiple use cases with Azure API Management25 June 2024·12 mins
Understanding Azure OpenAI's x-ratelimit-remaining-tokens and x-ratelimit-remaining-requests headers5 June 2024·7 mins