Configurable model options for a BERT model.

See (BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding) for more details.

Attributes: seq_len: Length of the sequence to feed into the model. do_fine_tuning: If true, then the BERT model is not frozen for training. dropout_rate: The rate for dropout.

seq_len Dataclass field
do_fine_tuning Dataclass field
dropout_rate Dataclass field



do_fine_tuning True
dropout_rate 0.1
seq_len 128