aboutsummaryrefslogtreecommitdiffstatshomepage
path: root/utils/word_conll_to_char_conll.py
Commit message (Collapse)AuthorAgeFilesLines
* refactor: Refactor TRPG NER model SDK: restructure codebase into ↵HsiangNianian2025-12-301-1/+2
| | | | base_model_trpgner package, implement training and inference modules, and add model download functionality. Remove legacy training and utils modules. Enhance documentation and examples for better usability.
* feat: Refactor and enhance TRPG NER model SDKHsiangNianian2025-12-301-0/+55
- Removed deprecated `word_conll_to_char_conll.py` utility and integrated its functionality into the new `utils` module. - Introduced a comprehensive GitHub Actions workflow for automated publishing to PyPI and GitHub Releases. - Added `__init__.py` files to establish package structure for `basemodel`, `inference`, `training`, and `utils` modules. - Implemented model downloading functionality in `download_model.py` to fetch pre-trained ONNX models. - Developed `TRPGParser` class for ONNX-based inference, including methods for parsing TRPG logs. - Created training utilities in `training/__init__.py` for NER model training with Hugging Face Transformers. - Enhanced utility functions for CoNLL file parsing and dataset creation. - Added command-line interface for converting CoNLL files to datasets with validation options.