Text categorization for intellectual property: comparing balanced Winnow with SVM on different document representations