You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

95 lines
3.1 KiB

  1. <div align="center">
  2. <img src="https://raw.githubusercontent.com/doccano/doccano/master/docs/images/logo/doccano.png">
  3. </div>
  4. # doccano
  5. doccano is an open source text annotation tool for humans. It provides annotation features for text classification, sequence labeling and sequence to sequence tasks. So, you can create labeled data for sentiment analysis, named entity recognition, text summarization and so on. Just create a project, upload data and start annotating. You can build a dataset in hours.
  6. ## Demo
  7. You can try the [annotation demo](http://doccano.herokuapp.com).
  8. ![Demo image](https://raw.githubusercontent.com/doccano/doccano/master/docs/images/demo/demo.gif)
  9. ## Features
  10. - Collaborative annotation
  11. - Multi-language support
  12. - Mobile support
  13. - Emoji :smile: support
  14. - Dark theme
  15. - RESTful API
  16. ## Installation
  17. To install doccano, simply run:
  18. ```bash
  19. pip install doccano
  20. ```
  21. By default, SQLite 3 is used for the default database. If you want to use PostgreSQL, install the additional dependencies:
  22. ```bash
  23. pip install 'doccano[postgresql]'
  24. ```
  25. and set `DATABASE_URL` environment variable according to your PostgreSQL credentials:
  26. ```bash
  27. DATABASE_URL="postgres://${POSTGRES_USER}:${POSTGRES_PASSWORD}@${POSTGRES_HOST}:${POSTGRES_PORT}/${POSTGRES_DB}?sslmode=disable"
  28. ```
  29. After installation, run the following commands:
  30. ```bash
  31. # Initialize database.
  32. doccano init
  33. # Create a super user.
  34. doccano createuser --username admin --password pass
  35. # Start a web server.
  36. doccano webserver --port 8000
  37. ```
  38. In another terminal, run the following command:
  39. ```bash
  40. # Start the task queue to handle file upload/download.
  41. doccano task
  42. ```
  43. Go to <http://127.0.0.1:8000/>.
  44. ## FAQ
  45. - [How to create a user](https://doccano.github.io/doccano/faq/#how-to-create-a-user)
  46. - [How to add a user to your project](https://doccano.github.io/doccano/faq/#how-to-add-a-user-to-your-project)
  47. - [How to change the password](https://doccano.github.io/doccano/faq/#how-to-change-the-password)
  48. See the [documentation](https://doccano.github.io/doccano/) for details.
  49. ## Contribution
  50. As with any software, doccano is under continuous development. If you have requests for features, please file an issue describing your request. Also, if you want to see work towards a specific feature, feel free to contribute by working towards it. The standard procedure is to fork the repository, add a feature, fix a bug, then file a pull request that your changes are to be merged into the main repository and included in the next release.
  51. Here are some tips might be helpful. [How to Contribute to Doccano Project](https://github.com/doccano/doccano/wiki/How-to-Contribute-to-Doccano-Project)
  52. ## Citation
  53. ```tex
  54. @misc{doccano,
  55. title={{doccano}: Text Annotation Tool for Human},
  56. url={https://github.com/doccano/doccano},
  57. note={Software available from https://github.com/doccano/doccano},
  58. author={
  59. Hiroki Nakayama and
  60. Takahiro Kubo and
  61. Junya Kamura and
  62. Yasufumi Taniguchi and
  63. Xu Liang},
  64. year={2018},
  65. }
  66. ```
  67. ## Contact
  68. For help and feedback, please feel free to contact [the author](https://github.com/Hironsan).