You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
Hironsan 9a9d765bbb Update README 6 years ago
data Add project template 6 years ago
doccano Add demo image 6 years ago
docs Update README 6 years ago
tests Update testdata 6 years ago
.gitignore Add project template 6 years ago
ISSUE_TEMPLATE.md Add project template 6 years ago
LICENSE Initial commit 6 years ago
README.md Update README 6 years ago
requirements.txt Implement classifier 6 years ago
tox.ini Add project template 6 years ago

README.md

doccano

doccano is a document annotation tool. The purpose is making annotation process efficient. First, manually labeling small data in minutes using the labeling interface. Second, train built-in classification model using the labeled data and classify unlabeled data with their probability. Then, sort data in ascending order by the probability. You can efficiently annotate the data.

doccano

Features

  • Active Learning based annotation

Requirements

  • Python3.6+
  • numpy 1.14.3+
  • scikit-learn 0.19.1+
  • scipy 1.1.0+

Put data into doccano/data directory.

Installation

To install doccano, simply run:

$ git clone https://github.com/chakki-works/doccano.git
$ cd doccano
$ pip install -r requirements.txt

Usage

First, run web application:

$ cd doccano/server
$ python run_server.py

Then, open http://localhost:8080 in your browser.