A schema conversion approach for constructing heterogeneous information networks from documents