Exploiting Domain Knowledge for Cross-domain Text Classification in Heterogeneous Data Sources