Hi there,
I was wondering whether it makes sense to "trick" LASER to consider a whole document made out of multiple sentences as a single sentence? That way I'd get a whole document embedding and wouldn't need to devise any aggregation method.
I know there's a limit of 12000 tokens on sentences (as per
|
parser.add_argument('--max-tokens', type=int, default=12000, |
) but let's forget this for now please :)