how embedding models that are not word based (like
GysBert) can be made to be sentence or chunk based? Conceptually it is quite basic machine learning layers which are doing the job, however, what does that mean, why they capture the semantics, have to check.