我需要在我的程序中比较字符串,而不考虑特殊的国家字符,所以例如"C“和"Č”应该相同。我使用了排列器类。对于第一种和第二种情况,它的工作方式与预期相同,但在第三种和第四种情况下没有。
包排序器;
import java.text.Collator;
import java.util.Locale;
public class Coll {
public static void main(String[] args) {
Locale locale = new Locale("sk", "SK");
Collator collator = Collator.getInstance(locale);
collator.setStrength(Collator.PRIMARY);
System.out.println(collator.compare("T", "Ť"));
System.out.println(collator.compare("L", "Ľ"));
System.out.println(collator.compare("C", "Č"));
System.out.println(collator.compare("S", "Š"));
}
}我期望0 0 0,但实际输出是0 0 -1 -1
发布于 2019-01-24 01:13:54
查看java.text.Normalizer类。我从来没有广泛地使用过它,但它看起来对您的目的很有用。示例:
import java.text.Normalizer;
import java.text.Normalizer.Form;
public class NewClass2 {
public static void main(String[] args) {
System.out.println("T".compareTo( normalizeString("Ť") ));
System.out.println("L".compareTo( normalizeString("Ľ") ));
System.out.println("C".compareTo( normalizeString("Č") ));
System.out.println("S".compareTo( normalizeString("Š") ));
System.out.println("O".compareTo( normalizeString("Ö") ));
System.out.println("U".compareTo( normalizeString("Ü") ));
System.out.println("A".compareTo( normalizeString("Ä") ));
System.out.println("A".compareTo( normalizeString("Å") ));
}
public static String normalizeString(String str){
return Normalizer.normalize(str, Form.NFD).replaceAll("[^\\p{ASCII}]", "");
}
}有关更多信息,请阅读此博客文章:normalizing-text-in-java
https://stackoverflow.com/questions/54331615
复制相似问题