Hi there,is this the correct approach to fulfil the rate limit restrictions in the foundational model API?from langchain_core.rate_limiters import InMemoryRateLimiter
rate_limiter = InMemoryRateLimiter(
requests_per_second=2.0,
check_every_...
Hi there,I am trying to parellize a text extraction via the Databrick foundational model.Any pointers to suggestions or examples are welcomeThe code and error below.model = "databricks-meta-llama-3-1-70b-instruct"
temperature=0.0
max_tokens=1024
sch...