BUGZAP

Bugzilla Analysis Program is an analysis tool that uses some basic Natural Language Processing to extract descriptive information from bug reports.

Requirements

Bugzap uses Distiller: Automated Keyword Extraction from Document Collections

Usage

###Scraping Bugzilla

To start scraping bug reports for a Bugzilla instance, run:

$ scrapy crawl query.bugzilla -a query=queryfile -o outfile -t json

The query file needs the domain of the Bugzilla instance in its first line, plus at least one search query URL to start the scrape from:

mybugzillainstance.com
https://issues.mybugzillainstance.com/buglist.cgi?email1=myemail%40email.com&emailassigned_to1=1&emailcc1=1&emailreporter1=1&emailtype1=exact&list_id=2581552

Scrapy will store all retrieved bug reports to outfile in json format.

###Processing Documents

To process the retrieved bug reports, run:

$ python bugzap/main.py -j outfile -r bugzilla -b mylacklist -n mybugs

All reports will be saved to the bugzap/visualization/data/mybugs folder.

For further info, see: $ python buzap/main.py -h

###Viewing Statistics

The reports can be viewed from Mozilla by opening bugzap/visualization/freqDist.html file and entering the name of your data set:

$ firefox bugzap/visualization/freqDist.html

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
blacklists		blacklists
bugzap		bugzap
resources		resources
.gitignore		.gitignore
README.md		README.md
scrapy.cfg		scrapy.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BUGZAP

Requirements

Usage

About

Releases

Packages

Languages

FranciscoCanas/bugzap

Folders and files

Latest commit

History

Repository files navigation

BUGZAP

Requirements

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages