Can the other LLMs keep up?
Remove the rare variants, here singletons and doubletons by setting AC threshold with 'bcftools view'. Split multiallelic sites to biallelic records with 'bcftools norm'. Keep only SNPs and INDELs ...