Skip to content

Hu-chi/AI-literature-collector

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 

Repository files navigation

Introduction

I make a tool for searching literature about some topic in my area. The search results are organized into notions to form a knowledge base.

I I have collected the titles and abstracts of the following conference papers for quick offline retrieval.

Common NLP Conferences:

Common AI Conferences:

Common CV Conferences:

Common Speech Conferences:

I organized the collected papers into the following format and saved them in a year*.json format file

{
	"url": "https://ojs.aaai.org/index.php/AAAI/article/view/8117", 
	"abstract": "With the exponential increase in the number of documents available online, e.g., news articles, weblogs, scientific documents, the development of effective and efficient classification methods is needed. The performance of document classifiers critically depends, among other things, on the choice of the feature representation. The commonly used \"bag of words\" and n-gram representations can result in prohibitively high dimensional input spaces. Data mining algorithms applied to these input spaces may be intractable due to the large number of dimensions. Thus, dimensionality reduction algorithms that can process data into features fast at runtime, ideally in  constant time per feature, are greatly needed in high throughput applications, where the number of features and data points can be in the order of millions. One promising line of research to dimensionality reduction is feature clustering. We propose to combine two types of feature clustering, namely hashing and abstraction based on hierarchical agglomerative clustering, in order to take advantage of the strengths of both techniques. Experimental results on two text data sets show that the combined approach uses significantly smaller number of features and gives similar performance when compared with the \"bag of words\" and n-gram approaches.", 
	"title": "Combining Hashing and Abstraction in Sparse High Dimensional Feature Spaces"
}

If the abstract is not collected, it will be stored as a null value.

Search

Use search.py to find the papers your need.

An example for finding NER papers is in the function main, change it as you want.

About

AI Paper Collector (Title + Abstract). (2012-Now)

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages