File size: 1,308 Bytes
1c744c7
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
a31da5f
 
1c744c7
 
 
 
 
a31da5f
 
 
 
 
 
 
 
 
 
 
1c744c7
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
<h1 align="center">
FileGPT 🤖
</h1>

Read the article to know how it works: <a href="">Medium Article</a>

With File GPT you will be able to extract all the information from a file.
You will obtain the transcription, the embedding of each segment and also ask questions to the file through a chat.

All code was written with the help of <a href="https://codegpt.co">Code GPT</a>

<a href="https://codegpt.co" target="_blank"><img width="753" alt="Captura de Pantalla 2023-02-08 a la(s) 9 16 43 p  m" src="https://user-images.githubusercontent.com/6216945/217699939-eca3ae47-c488-44da-9cf6-c7caef69e1a7.png"></a>

<hr>
<br>

# Features

- Read any pdf, docx, txt or csv file
- Embedding texts segments with Langchain and OpenAI (**text-embedding-ada-002**)
- Chat with the file using **streamlit-chat** and LangChain QA with source and (**text-davinci-003**)

# Running Locally

1. Clone the repository

```bash
git clone https://github.com/davila7/file-gpt
cd file-gpt
```
2. Install dependencies

These dependencies are required to install with the requirements.txt file:

* openai
* pypdf
* scikit-learn
* numpy
* tiktoken
* docx2txt
* langchain
* pydantic
* typing
* faiss-cpu
* streamlit_chat

```bash
pip install -r requirements.txt
```
3. Run the Streamlit server

```bash
streamlit run app.py
```