You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm encountering this as well. While I haven't been able to figure out the root cause yet, I learned that when you create the query list, the keys (ie 'base salary', 'vacation') are used to reduce the amount of content to only chunks of content containing those terms. So if you're using a label there that won't be found in the document, it won't return any chunks of text.
When I updated the query list to something that was on the first page I then got some results. But now it seems to only be parsing the first page - or 10 chunks. So I'm not sure if it's limiting itself to 10 chunks or is encountering an issue with pages in the document.
Their example contracts have multiple pages so I'm not quite sure what the issue is.
Hey! I ran into that same issue with "prompter.add_source_document()" giving a warning that the source wasn't added. After looking into it, I think the main reason is that the query key we're using (like 'base salary' or 'vacation') has to actually exist exactly in the document. If it doesn’t match any content, no chunks get picked up and it doesn’t add the source.
To check, try using a super common word like “employee” or even set query=None just to see if the document gets loaded without filtering. That’ll tell us if the file itself is fine.
Also, the file type might matter — make sure it’s a .pdf or .txt. And spaces in the filename probably aren’t the issue, but to be safe, try renaming one file to something simple like contract1.pdf.
Lastly, it looks like it only pulls in up to 10 chunks, so if our key is on a later page or not found early, it might get skipped.
Uh oh!
There was an error while loading. Please reload this page.
I followed the following LLMWare tutorial on YouTube and ran into the following error when using
Prompter.add_source_document()
.source = prompter.add_source_document(contracts_path, contract, query=key)
This gives me a Warning that the source was not added.
What could be the issue?
My
contracts_path = ./docs/
. My files do have spaces in their names, but I doubt that would be an issue. Could it be the file-type suffix?Tutorial Video Followed: https://www.youtube.com/watch?v=8aV5p3tErP0
The text was updated successfully, but these errors were encountered: