He hoped there would be stew for dinner, turnips and carrots and bruised potatoes and fat mutton pieces to be ladled out in thick, peppered, flour-fattened sauce.
And use it in the following codes.
import os from openai import OpenAI client = OpenAI( base_url="https://whisper-large-v3.lepton.run/api/v1", api_key=os.environ.get('LEPTON_API_TOKEN') ) audio_file = open("output.mp3", "rb") transcript = client.audio.transcriptions.create( model="whisper-1", file=audio_file ) print(transcript)
The rate limit for the Serverless Endpoints is 10 requests per minute across all models under Basic Plan. If you need a higher rate limit with SLA please upgrade to standard plan, or use dedicated deployment.