Skip to content
@USCDataScience

USC Information Retrieval & Data Science

USC Information Retrieval and Data Science Group

Pinned Loading

  1. sparkler sparkler Public

    Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.

    Java 410 141

  2. SentimentAnalysisParser SentimentAnalysisParser Public

    Combines Apache OpenNLP and Apache Tika and provides facilities for automatically deriving sentiment from text.

    32 9

  3. autoextractor autoextractor Public

    Forked from thammegowda/autoextractor

    A toolkit for clustering web pages based on various similarity measures.

    Java 32 11

  4. polar.usc.edu polar.usc.edu Public

    Polar USC activities related to NSF Polar CyberInfrastructure program at the University of Southern California

    HTML 15 35

  5. supervising-ui supervising-ui Public

    Web UI for labelling dataset for supervised learning.

    Python 78 24

  6. dl4j-kerasimport-examples dl4j-kerasimport-examples Public

    This repository contains deeplearning4j examples for importing and making use of models trained in keras

    Java 27 28

Repositories

Showing 10 of 65 repositories
  • uscdatascience.github.io Public

    USC Information Retrieval and Data Science Group

    USCDataScience/uscdatascience.github.io’s past year of commit activity
    HTML 9 Apache-2.0 25 0 0 Updated Oct 4, 2024
  • tika-dockers Public

    A suite of Machine Learning / Deep Learning Dockerfiles to allow Apache Tika to extract objects and to produce textual captions for images and video

    USCDataScience/tika-dockers’s past year of commit activity
    21 Apache-2.0 5 1 1 Updated Jun 18, 2024
  • SentimentAnalysisParser Public

    Combines Apache OpenNLP and Apache Tika and provides facilities for automatically deriving sentiment from text.

    USCDataScience/SentimentAnalysisParser’s past year of commit activity
    32 Apache-2.0 9 2 1 Updated May 3, 2023
  • sparkler Public

    Spark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.

    USCDataScience/sparkler’s past year of commit activity
    Java 410 Apache-2.0 141 33 22 Updated Mar 30, 2023
  • polar.usc.edu Public

    Polar USC activities related to NSF Polar CyberInfrastructure program at the University of Southern California

    USCDataScience/polar.usc.edu’s past year of commit activity
    HTML 15 Apache-2.0 35 0 0 Updated Jan 15, 2023
  • polar-deep-insights Public

    Conceptual - Temporal - Spatial analysis of the trec polar dataset

    USCDataScience/polar-deep-insights’s past year of commit activity
    JavaScript 10 8 0 33 Updated Jan 4, 2023
  • NLTKRest Public

    This is a REST Server endpoint built using Flask and Python.

    USCDataScience/NLTKRest’s past year of commit activity
    Java 24 Apache-2.0 13 1 2 Updated Nov 16, 2022
  • AgePredictor Public

    Age classification from text using PAN16, blogs, Fisher Callhome, and Cancer Forum

    USCDataScience/AgePredictor’s past year of commit activity
    Java 17 Apache-2.0 12 6 6 Updated Jul 1, 2022
  • autoextractor Public Forked from thammegowda/autoextractor

    A toolkit for clustering web pages based on various similarity measures.

    USCDataScience/autoextractor’s past year of commit activity
    Java 32 Apache-2.0 13 3 0 Updated Oct 27, 2021
  • parser-indexer-py Public

    Python tools for parsing documents and building the inverted index with enriched metadata. Java version with slightly different features - https://github.com/USCDataScience/parser-indexer

    USCDataScience/parser-indexer-py’s past year of commit activity
    Jupyter Notebook 9 Apache-2.0 3 6 0 Updated Sep 2, 2021