You will also need an embeddings model and a large language model. We recommend using @react-native-rag/executorch for on-device inference. To use it, install the ...
PYTHON_VERSION_TAG=310 && gh release download \ --repo arm/pte-adapter-model-explorer \ --pattern "*py${PYTHON_VERSION_TAG}*.whl" && pip install *py${PYTHON_VERSION ...