Split English text into entities
Please download the pre-compiled binary from Latest Release. If you are using Windows, please download the zip file like
tbx-xx.x.xxx-win.zip. Then, extract the archive and place
tbx.exe on the Desktop folder.
The watermint toolbox can run from any path in the system if allowed by the system. But the instruction samples are using the Desktop folder. Please replace the path if you placed the binary other than the Desktop folder.
This document uses the Desktop folder for command example.
.\tbx.exe util text nlp english entity -in /LOCAL/PATH/TO/INPUT.txt
$HOME/Desktop/tbx util text nlp english entity -in /LOCAL/PATH/TO/INPUT.txt
Note for macOS Catalina 10.15 or above: macOS verifies Developer identity. Currently,
tbx is not ready for it. Please select “Cancel” on the first dialogue. Then please proceed “System Preference”, then open “Security & Privacy”, select “General” tab.
You may find the message like:
“tbx” was blocked from use because it is not from an identified developer.
And you may find the button “Allow Anyway”. Please hit the button with your risk. At second run, please hit button “Open” on the dialogue.
|Consider line break as regular white space while tokenizing
|Input file path
|Custom path to auth database (default: $HOME/.toolbox/secrets/secrets.db)
|Auto open URL or artifact folder
|Bandwidth limit in K bytes per sec for upload/download content. 0 for unlimited
|Memory budget (limits some feature to reduce memory footprint)
|Storage budget (limits logs or some feature to reduce storage usage)
|Maximum concurrency for running operation
|Number of processors
|Enable debug mode
|Enable experimental feature(s).
|Extra parameter file path
|Output format (none/text/markdown/json)
|HTTP/HTTPS proxy (hostname:port). Please specify
DIRECT if you want skip setting proxy.
|Suppress non-error messages, and make output readable by a machine (JSON format)
|Job data retain policy
|Do not store tokens into a file
|Skip logging in the local storage
|Show current operations for more detail.
English text file to split
The executable automatically detects your proxy configuration from the environment. However, if you got an error or you want to specify explicitly, please add -proxy option, like -proxy hostname:port. Currently, the executable doesn’t support proxies which require authentication.