Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Removing faulty answers. #3

Open
taesiri opened this issue Oct 8, 2023 · 2 comments
Open

Removing faulty answers. #3

taesiri opened this issue Oct 8, 2023 · 2 comments

Comments

@taesiri
Copy link
Owner

taesiri commented Oct 8, 2023

It seems that either the Latex flattening code or the Claude-2.0 API (which one is unclear) is doing something strange, and some answers are generated without the body of the paper. For instance, for this question, the answer clearly states that it does not have access to the paper material. We should investigate this and find a filtering process to remove bad answers.

Without having access to the full paper, it's difficult to provide an accurate 1-sentence summary. However, academic papers often have an abstract at the beginning that summarizes the key points and contributions. The abstract would be a good starting point to understand the main idea of the paper in a concise way. If a 1-sentence summary is still needed, it would require looking at the introduction and conclusion sections to identify the overarching theme and outcomes of the research. Having access to key sections like these would help generate a very brief summary statement.

@taesiri
Copy link
Owner Author

taesiri commented Oct 8, 2023

A simple text search shows that following papers contain similar phrases:

# total of 43

['./papers/2301.13616.md',
 './papers/2308.16463.md',
 './papers/2212.08073.md',
 './papers/2309.0791.md',
 './papers/2107.13586.md',
 './papers/2305.19835.md',
 './papers/2306.17194.md',
 './papers/2306.04031.md',
 './papers/2304.04746.md',
 './papers/2305.15581.md',
 './papers/2310.03714.md',
 './papers/2305.07185.md',
 './papers/2207.05739.md',
 './papers/2302.04023.md',
 './papers/2303.17651.md',
 './papers/2307.01848.md',
 './papers/2303.04673.md',
 './papers/2307.04577.md',
 './papers/2307.10350.md',
 './papers/2306.17582.md',
 './papers/2308.11551.md',
 './papers/2305.09515.md',
 './papers/2205.11916.md',
 './papers/2309.08637.md',
 './papers/2309.15129.md',
 './papers/2305.10855.md',
 './papers/2309.03409.md',
 './papers/2304.10970.md',
 './papers/2201.07207.md',
 './papers/2305.14540.md',
 './papers/2308.01313.md',
 './papers/1803.11203.md',
 './papers/2309.16588.md',
 './papers/2308.13954.md',
 './papers/2306.09539.md',
 './papers/2309.16235.md',
 './papers/2305.16960.md',
 './papers/2307.16715.md',
 './papers/2306.09557.md',
 './papers/2306.05425.md',
 './papers/2107.03374.md',
 './papers/2210.09261.md',
 './papers/2304.13169.md']

@taesiri
Copy link
Owner Author

taesiri commented Oct 17, 2023

The above papers have been manually removed and re-submitted to the QA queue. Now, we have only four bad responses.

['./papers/2307.01848.md',
 './papers/2205.11916.md',
 './papers/2309.08637.md',
 './papers/2305.10855.md']

@github-staff github-staff deleted a comment May 12, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant