Investigating the effect of corpus construction on latent dirichlet allocation based feature location