diff options
| author | 2025-12-30 19:14:39 +0800 | |
|---|---|---|
| committer | 2025-12-30 19:14:39 +0800 | |
| commit | 7ac684f1f82023c6284cd7d7efde11b8dc98c149 (patch) | |
| tree | 4ac4e9fb72a4e1e2578d9fb4e9704967b052ec15 /.gitignore | |
| parent | 12910f3a937633a25aa0de463a6edf756f2b8cdd (diff) | |
| download | base-model-7ac684f1f82023c6284cd7d7efde11b8dc98c149.tar.gz base-model-7ac684f1f82023c6284cd7d7efde11b8dc98c149.zip | |
feat: Implement TRPG NER training and inference script with robust model path detection and enhanced timestamp/speaker handling
- Added main training and inference logic in main.py, including CoNLL parsing, tokenization, and model training.
- Introduced TRPGParser class for inference with entity aggregation and special handling for timestamps and speakers.
- Developed utility functions for converting word-level CoNLL to char-level and saving datasets in various formats.
- Added ONNX export functionality for the trained model.
- Created a comprehensive requirements.txt and updated pyproject.toml with necessary dependencies.
- Implemented tests for ONNX inference to validate model outputs.
Diffstat (limited to '.gitignore')
| -rw-r--r-- | .gitignore | 6 |
1 files changed, 5 insertions, 1 deletions
@@ -161,4 +161,8 @@ cython_debug/ # uv .python-version -uv.lock
\ No newline at end of file +uv.lock + +# model +models/ +dataset/
\ No newline at end of file |
