Skip to content

Approach to identify similar java classes in a dataset. By creating representational queries for each Document. Takes into account language constructs and structure. Finding similarity is reduced to Finding the probability that a document was generated by a query.

Notifications You must be signed in to change notification settings

sahutd/ir_source_reuse_detection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ir_source_reuse_detection

Approach to identify similar java classes in a dataset. By creating representational queries for each Document. Takes into account language constructs and structure. Finding similarity is reduced to Finding the probability that a document was generated by a query.

Running

Import project to your favorite IDE like Eclipse or IntelliJ( the libraries required will be downloaded)

Add your test Java classes into the src/queryset folder

Run the program and select the classes you want to compare with in the GUI

Sample Gui Screenshot

About

Approach to identify similar java classes in a dataset. By creating representational queries for each Document. Takes into account language constructs and structure. Finding similarity is reduced to Finding the probability that a document was generated by a query.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages