This commit updates some of JSON snippets in README.md file and
removes the `json` language tag from the code blocks.
The motivation for this changes is that if there is invalid json in a
code snippet these are highlighted in red which can make it somewhat
difficult to read and can be a little distracting.
- Note: In streaming mode (`stream`), only `content`, `tokens` and `stop` will be returned until end of completion. Responses are sent using the [Server-sent events](https://html.spec.whatwg.org/multipage/server-sent-events.html) standard. Note: the browser's `EventSource` interface cannot be used due to its lack of `POST` request support.
- `completion_probabilities`: An array of token probabilities for each completion. The array's length is `n_predict`. Each item in the array has a nested array `top_logprobs`. It contains at **maximum** `n_probs` elements:
- ```json
+ ```
{
"content": "<the generated completion text>",
"tokens": [ generated token ids if requested ],
```
With input 'รก' (utf8 hex: C3 A1) on tinyllama/stories260k
-```json
+```
{
"tokens": [
{"id": 198, "piece": [195]}, // hex C3
**Response format**
-```json
+```
[
{
"index": 0,