[INFERENCE API] Validate the max chunk size against the inference service and model.

### Description

The max chunk size is limited by the model's context window and therefore model dependant. Rather than having a single max value for all it should be specific to the model or the service provider similar to the logic in https://github.com/elastic/elasticsearch/pull/132169

Related to https://github.com/elastic/elasticsearch/pull/133718 which removes the limitation