manifest/README.md

# Manifest
How to make prompt programming with FMs a little easier.

## Install
Download the code:
```bash
git clone git@github.com:HazyResearch/manifest.git
cd manifest
```

Install:
```bash
pip install poetry
poetry install --no-dev
```

Dev Install:
```bash
pip install poetry
make dev
```

## Getting Started
Running is simple to get started. If using OpenAI, set `export OPENAI_API_KEY=<OPENAIKEY>` then run

```python
from manifest import Manifest

# Start a manifest session
manifest = Manifest(
    client_name = "openai",
)
manifest.run("Why is the grass green?")
```

We also support AI21, OPT models, and HuggingFace models (see [below](#huggingface-models)).

Caching by default is turned off, but to cache results, run

```python
from manifest import Manifest

# Start a manifest session
manifest = Manifest(
    client_name = "openai",
    cache_name = "sqlite",
    cache_connection = "mycache.sqlite",
)
manifest.run("Why is the grass green?")
```

We also support Redis backend.

## Manifest Components
Manifest is meant to be a very light weight package to help with prompt iteration. Three key design decisions are

* Prompt are functional -- they can take an input example and dynamically change
* All models are behind API calls (e.g., OpenAI)
* Everything can cached for reuse to both save credits and to explore past results

### Prompts
A Manifest prompt is a function that accepts a single input to generate a string prompt to send to a model.

```python
from manifest import Prompt
prompt = Prompt(lambda x: "Hello, my name is {x}")
print(prompt("Laurel"))
>>> "Hello, my name is Laurel"
```

We also let you use static strings
```python
prompt = Prompt("Hello, my name is static")
print(prompt())
>>> "Hello, my name is static"
```

### Sessions

Each Manifest run is a session that connects to a model endpoint and backend database to record prompt queries. To start a Manifest session for OpenAI, make sure you run
```bash
export OPENAI_API_KEY=<OPENAIKEY>
```
so we can access OpenAI.

Then run:
```python
from manifest import Manifest

manifest = Manifest(
    client_name = "openai",
    cache_name = "sqlite",
    cache_connection = "sqlite.cache"
)
```
This will start a session with OpenAI and save all results to a local file called `sqlite.cache`.

We also support a Redis backend. If you have a Redis database running on port 6379, run
```python
manifest = Manifest(
    client_name = "openai",
    cache_name = "redis",
    cache_connection = "localhost:6379"
)
```
As a hint, if you want to get Redis running, see the `docker run` command below under development.

We will explain [below](#huggingface-models) how to use Manifest for a locally hosted HuggingFace model.

Once you have a session open, you can write and develop prompts.

```python
prompt = Prompt(lambda x: "Hello, my name is {x}")
result = manifest.run(prompt, "Laurel")
```

You can also run over multiple examples.
```python
results = manifest.batch_run(prompt, ["Laurel", "Avanika"])
```

If something doesn't go right, you can also ask to get a raw manifest Response.
```python
result_object = manifest.batch_run(prompt, ["Laurel", "Avanika"], return_response=True)
print(result_object.get_request())
print(result_object.is_cached())
print(result_object.get_response())
```

By default, we do not truncate results based on a stop token. You can change this by either passing a new stop token to a Manifest session or to a `run` or `batch_run`. If you set the stop token to `""`, we will not truncate the model output.
```python
result = manifest.run(prompt, "Laurel", stop_token="and")
```

If you want to change default parameters to a model, we pass those as `kwargs` to the client.
```python
result = manifest.run(prompt, "Laurel", max_tokens=50)
```

### Huggingface Models
To use a HuggingFace generative model, in `manifest/api` we have a Falsk application that hosts the models for you.

In a separate terminal or Tmux/Screen session, run
```python
python3 manifest/api/app.py --model_type huggingface --model_name EleutherAI/gpt-j-6B --device 0
```
You will see the Flask session start and output a URL `http://127.0.0.1:5000`. Pass this in to Manifest. If you want to use a different port, set the `FLASK_PORT` environment variable.

```python
manifest = Manifest(
    client_name = "huggingface",
    client_connection = "http://127.0.0.1:5000",
)
```

If you have a custom model you trained, pass the model path to `--model_name`.

## Development
Before submitting a PR, run
```bash
export REDIS_PORT="6380"  # or whatever PORT local redis is running for those tests
cd <REDIS_PATH>
docker run -d -p 127.0.0.1:${REDIS_PORT}:6380 -v `pwd`:`pwd` -w `pwd` --name manifest_redis_test redis
make test
```

To use our development Redis database, email [Laurel](lorr1@cs.stanford.edu). If you have access to our GCP account, in a separate terminal, run
```bash
gcloud compute ssh "manifest-connect" --zone "europe-west4-a" --project "hai-gcp-head-models" -- -N -L 6379:10.152.93.107:6379
```

Then if you issue
```bash
redis-cli ping
```
You should see a `PONG` response from our database.
Fix readme and separate dev 2 years ago			`# Manifest`
			`How to make prompt programming with FMs a little easier.`
First main commit 2 years ago
Fix readme and separate dev 2 years ago			`## Install`
First main commit 2 years ago			`Download the code:`
[feature] better naming models in cache 2 years ago			```bash
First main commit 2 years ago			`git clone git@github.com:HazyResearch/manifest.git`
			`cd manifest`
			```

			`Install:`
[feature] better naming models in cache 2 years ago			```bash
First main commit 2 years ago			`pip install poetry`
Fix readme and separate dev 2 years ago			`poetry install --no-dev`
First main commit 2 years ago			```
Fix readme and separate dev 2 years ago
			`Dev Install:`
[feature] better naming models in cache 2 years ago			```bash
First main commit 2 years ago			`pip install poetry`
			`make dev`
remove circular imports in client, manifest; add prompt serialization and dill dep 2 years ago			```
Fix readme and separate dev 2 years ago
			`## Getting Started`
			Running is simple to get started. If using OpenAI, set `export OPENAI_API_KEY=<OPENAIKEY>` then run

			```python
			`from manifest import Manifest`

			`# Start a manifest session`
			`manifest = Manifest(`
			`client_name = "openai",`
			`)`
			`manifest.run("Why is the grass green?")`
			```

			`We also support AI21, OPT models, and HuggingFace models (see [below](#huggingface-models)).`

			`Caching by default is turned off, but to cache results, run`

			```python
			`from manifest import Manifest`

			`# Start a manifest session`
			`manifest = Manifest(`
			`client_name = "openai",`
			`cache_name = "sqlite",`
			`cache_connection = "mycache.sqlite",`
			`)`
			`manifest.run("Why is the grass green?")`
			```

			`We also support Redis backend.`

			`## Manifest Components`
			`Manifest is meant to be a very light weight package to help with prompt iteration. Three key design decisions are`
[feature] redis DB, flask API, tests 2 years ago
			`* Prompt are functional -- they can take an input example and dynamically change`
			`* All models are behind API calls (e.g., OpenAI)`
Fix readme and separate dev 2 years ago			`* Everything can cached for reuse to both save credits and to explore past results`
[feature] redis DB, flask API, tests 2 years ago
Fix readme and separate dev 2 years ago			`### Prompts`
[feature] redis DB, flask API, tests 2 years ago			`A Manifest prompt is a function that accepts a single input to generate a string prompt to send to a model.`
Fix readme and separate dev 2 years ago
[feature] better naming models in cache 2 years ago			```python
[feature] redis DB, flask API, tests 2 years ago			`from manifest import Prompt`
			`prompt = Prompt(lambda x: "Hello, my name is {x}")`
[feature] added opt; better kwargs for errors 2 years ago			`print(prompt("Laurel"))`
[feature] redis DB, flask API, tests 2 years ago			`>>> "Hello, my name is Laurel"`
			```
Fix readme and separate dev 2 years ago
[feature] redis DB, flask API, tests 2 years ago			`We also let you use static strings`
[feature] better naming models in cache 2 years ago			```python
[feature] redis DB, flask API, tests 2 years ago			`prompt = Prompt("Hello, my name is static")`
			`print(prompt())`
			`>>> "Hello, my name is static"`
			```

Fix readme and separate dev 2 years ago			`### Sessions`
[feature] redis DB, flask API, tests 2 years ago
			`Each Manifest run is a session that connects to a model endpoint and backend database to record prompt queries. To start a Manifest session for OpenAI, make sure you run`
[feature] better naming models in cache 2 years ago			```bash
[feature] redis DB, flask API, tests 2 years ago			`export OPENAI_API_KEY=<OPENAIKEY>`
			```
			`so we can access OpenAI.`

Fix readme and separate dev 2 years ago			`Then run:`
[feature] better naming models in cache 2 years ago			```python
[feature] redis DB, flask API, tests 2 years ago			`from manifest import Manifest`

			`manifest = Manifest(`
			`client_name = "openai",`
			`cache_name = "sqlite",`
			`cache_connection = "sqlite.cache"`
			`)`
			```
			This will start a session with OpenAI and save all results to a local file called `sqlite.cache`.

			`We also support a Redis backend. If you have a Redis database running on port 6379, run`
[feature] better naming models in cache 2 years ago			```python
[feature] redis DB, flask API, tests 2 years ago			`manifest = Manifest(`
			`client_name = "openai",`
			`cache_name = "redis",`
			`cache_connection = "localhost:6379"`
			`)`
			```
			As a hint, if you want to get Redis running, see the `docker run` command below under development.

			`We will explain [below](#huggingface-models) how to use Manifest for a locally hosted HuggingFace model.`

			`Once you have a session open, you can write and develop prompts.`

[feature] better naming models in cache 2 years ago			```python
[feature] redis DB, flask API, tests 2 years ago			`prompt = Prompt(lambda x: "Hello, my name is {x}")`
			`result = manifest.run(prompt, "Laurel")`
			```

			`You can also run over multiple examples.`
[feature] better naming models in cache 2 years ago			```python
[feature] redis DB, flask API, tests 2 years ago			`results = manifest.batch_run(prompt, ["Laurel", "Avanika"])`
			```

			`If something doesn't go right, you can also ask to get a raw manifest Response.`
[feature] better naming models in cache 2 years ago			```python
[feature] redis DB, flask API, tests 2 years ago			`result_object = manifest.batch_run(prompt, ["Laurel", "Avanika"], return_response=True)`
			`print(result_object.get_request())`
			`print(result_object.is_cached())`
			`print(result_object.get_response())`
			```

			By default, we do not truncate results based on a stop token. You can change this by either passing a new stop token to a Manifest session or to a `run` or `batch_run`. If you set the stop token to `""`, we will not truncate the model output.
[feature] better naming models in cache 2 years ago			```python
[feature] redis DB, flask API, tests 2 years ago			`result = manifest.run(prompt, "Laurel", stop_token="and")`
			```

			If you want to change default parameters to a model, we pass those as `kwargs` to the client.
[feature] better naming models in cache 2 years ago			```python
[feature] redis DB, flask API, tests 2 years ago			`result = manifest.run(prompt, "Laurel", max_tokens=50)`
			```
Fix readme and separate dev 2 years ago
			`### Huggingface Models`
[feature] redis DB, flask API, tests 2 years ago			To use a HuggingFace generative model, in `manifest/api` we have a Falsk application that hosts the models for you.

			`In a separate terminal or Tmux/Screen session, run`
[feature] better naming models in cache 2 years ago			```python
[feature] redis DB, flask API, tests 2 years ago			`python3 manifest/api/app.py --model_type huggingface --model_name EleutherAI/gpt-j-6B --device 0`
			```
			You will see the Flask session start and output a URL `http://127.0.0.1:5000`. Pass this in to Manifest. If you want to use a different port, set the `FLASK_PORT` environment variable.

[feature] better naming models in cache 2 years ago			```python
[feature] redis DB, flask API, tests 2 years ago			`manifest = Manifest(`
			`client_name = "huggingface",`
			`client_connection = "http://127.0.0.1:5000",`
			`)`
			```

[feature] better naming models in cache 2 years ago			If you have a custom model you trained, pass the model path to `--model_name`.

Fix readme and separate dev 2 years ago			`## Development`
remove circular imports in client, manifest; add prompt serialization and dill dep 2 years ago			`Before submitting a PR, run`
[feature] better naming models in cache 2 years ago			```bash
[feature] redis DB, flask API, tests 2 years ago			`export REDIS_PORT="6380" # or whatever PORT local redis is running for those tests`
			`cd <REDIS_PATH>`
			docker run -d -p 127.0.0.1:${REDIS_PORT}:6380 -v `pwd`:`pwd` -w `pwd` --name manifest_redis_test redis
remove circular imports in client, manifest; add prompt serialization and dill dep 2 years ago			`make test`
			```
[feature] redis DB, flask API, tests 2 years ago
			`To use our development Redis database, email [Laurel](lorr1@cs.stanford.edu). If you have access to our GCP account, in a separate terminal, run`
[feature] better naming models in cache 2 years ago			```bash
[feature] redis DB, flask API, tests 2 years ago			`gcloud compute ssh "manifest-connect" --zone "europe-west4-a" --project "hai-gcp-head-models" -- -N -L 6379:10.152.93.107:6379`
			```

			`Then if you issue`
[feature] better naming models in cache 2 years ago			```bash
[feature] redis DB, flask API, tests 2 years ago			`redis-cli ping`
			```
			You should see a `PONG` response from our database.