Abstractive Summarization

Automatic abstractive summarization for news articles.

How to run

Download repository.
Install project:
1. > cd /path/to/abstractive-summarization
2. > ./setup [path/to/target/directory]
  1. Project will be moved to the target directory. If no target directory is specified, project is installed in working directory.
  2. Stanford JARs will be downloaded into lib directory.
Run demo: > ./demo [arguments]
- If no arguments are specified, an excerpt on Tolstoy's biography (./resources/article-tolstoy.txt) will be summarized for the demo.
- If any arguments are specified for demo, the default text file will be ignored.

-h or --help .......... Help

-f or --file [filename] Path to file, containing body of text to be summarized

-m .................... Write metadata to file

-s .................... Write summary to file

Program reads in file.
Extracts important semantic information and writes it to file.
1. Extracts semantic triples
  - Example:"Bob likes puppies more than cats."
    System extracts multiple triples
    - [Bob | likes | puppies]
    - [Bob | likes | puppies more than cats]
2. Extracts named entities: Bob -> Person
Removes semantic information with low confidence scores.
Removes other problematic extracted triples based on a series of rules.
Removes sentences that were not assigned triples or had all of their triples removed.
Generates new sentences off of the remaining information.
Adds back in the time named entity information.
Performs formatting.
Displays summary.

This program still needs work, but the system does summarize a body of text.
Text files are provided inside resources.
By default, the summarized text is sent to standard out.
The meta-data (i.e., named entity information and triples) is written to a file: originalfilename-meta.txt.
The summary is written to a file: originalfilename-summary.txt.

Name		Name	Last commit message	Last commit date
Latest commit History 174 Commits
lib		lib
resources		resources
.gitignore		.gitignore
Concatenator.java		Concatenator.java
EntitiesList.java		EntitiesList.java
Entity.java		Entity.java
Extractor.java		Extractor.java
Formatter.java		Formatter.java
Fyles.java		Fyles.java
Manager.java		Manager.java
Network.java		Network.java
README.md		README.md
Sentence.java		Sentence.java
Times.java		Times.java
Triple.java		Triple.java
demo		demo
setup		setup