Rough token budget from UTF-8 byte length using a chars-per-token heuristic before calling language models.