Cee*_*jes 9 java arraylist longest-substring
我想比较两个ArrayLists并返回Java中最大的相似性子集.所以我想比较列表的部分而不仅仅是单个值.
例:
list 1 list 2
F A
A B
B C
C F
D D
Z Z
A
F
C
Run Code Online (Sandbox Code Playgroud)
最大的子集:
Arraylist: [A,B,C]
Run Code Online (Sandbox Code Playgroud)
第二个最大的子集应该是:
ArrayList: [D,Z]
Run Code Online (Sandbox Code Playgroud)
我怎样才能有效地做到这一点?(不使用超过2个for循环)
retainAll()不起作用,retainAll()返回相等的值,而不是最大的子集.
编辑 我想作为输出,在最大子集之前列出,最大子集,在最大子集之后列出.通过示例输出应该是:
[[F],[null]],[A,B,C],[[D,Z,A,F,C],[F,D,Z]]
Run Code Online (Sandbox Code Playgroud)
假设两个List都有String元素,你可以使用:
List<List<String>> beforeList = new ArrayList<>();
List<List<String>> afterList = new ArrayList<>();
List<String> commonSubsetList = new ArrayList<>();
for (int i = 0; i < list1.size(); i++) {
int k = i;
List<String> tmpList = new ArrayList<>();
List<String> tmpBeforeList1 = list1.subList(0, i); // container for before elements from list1
List<String> tmpAfterList1 = new ArrayList<>(); // container for after elements from list1
List<String> tmpBeforeList2 = new ArrayList<>(); // container for before elements from list2
List<String> tmpAfterList2 = new ArrayList<>(); // container for after elements from list2
for (int j = 0; j < list2.size();) {
if(k < list1.size() && list1.get(k).equals(list2.get(j))) {
// when common element is found, increment both counters and add element to tmp list
tmpList.add(list2.get(j));
k++;
j++;
} else {
if(tmpList.size() > 0) {
tmpAfterList1 = list1.subList(k, list1.size());
tmpAfterList2 = list2.subList(j, list2.size());
break;
} else {
tmpBeforeList2.add(list2.get(j));
}
j++;
}
}
if(commonSubsetList.size() <= tmpList.size()) {
// reset beforeList and afterList before adding new list
beforeList.clear();
afterList.clear();
// add new lists
beforeList.add(tmpBeforeList1);
beforeList.add(tmpBeforeList2);
afterList.add(tmpAfterList1);
afterList.add(tmpAfterList2);
commonSubsetList = new ArrayList<>(tmpList);
}
}
System.out.println(beforeList + ", " + commonSubsetList + ", " + afterList);
Run Code Online (Sandbox Code Playgroud)
这也包括列表之前和之后.希望这是你想要的.
您必须考虑列表中所有可能的项目对。当一对匹配时,然后尝试从这些索引开始构建子集。如果该子集大于当前候选集,则该子集将替换当前候选集。
一种优化是当子集大于较小列表长度的一半时退出。
您可以修改下面的示例以收集所有子集及其索引信息。
例子:
import java.util.ArrayList;
import java.util.Arrays;
import java.util.List;
public class Main {
/**
* Holds information about a sub set
*
* @param <T> type of subset items
*/
private static class SubSet<T> {
List<T> items; // items of subset
int startIndex1; // start index in list 1
int endIndex1; // end index in list 1
int startIndex2; // start index in list 2
int endIndex2; // end index in list 2
}
/**
* Run main example.
*
* @param args arguments - none honored.
* @throws java.lang.Exception - in case of any error.
*/
public static void main(String[] args) throws java.lang.Exception {
// define 2 lists
List<Integer> list1 = Arrays.asList(1, 2, 3, 4, 5, 6, 3, 2, 5, 6, 7, 3, 8);
List<Integer> list2 = Arrays.asList(2, 8, 7, 2, 3, 4, 5, 3, 2, 5, 1, 5);
// print the lists
System.out.println("First list: " + Arrays.toString(list1.toArray()));
System.out.println("Second list: " + Arrays.toString(list2.toArray()));
// get largest sub set
SubSet<Integer> largest = getLargestSubSet(list1, list2);
if (largest == null) {
// nothing found
System.out.println("No subset found.");
} else {
// print info about subset
System.out.println("Largest subset: " + Arrays.toString(largest.items.toArray()));
if (largest.startIndex1 > 0) {
List<Integer> beforeList1 = list1.subList(0, largest.startIndex1);
System.out.println("Items before largest subset in first list: "
+ Arrays.toString(beforeList1.toArray()));
}
if (largest.endIndex1 < list1.size() - 1) {
List<Integer> afterList1 = list1.subList(largest.endIndex1 + 1, list1.size());
System.out.println("Items after largest subset in first list: "
+ Arrays.toString(afterList1.toArray()));
}
if (largest.startIndex2 > 0) {
List<Integer> beforeList2 = list2.subList(0, largest.startIndex2);
System.out.println("Items before largest subset in second list: "
+ Arrays.toString(beforeList2.toArray()));
}
if (largest.endIndex2 < list2.size() - 1) {
List<Integer> afterList2 = list2.subList(largest.endIndex2 + 1, list2.size());
System.out.println("Items after largest subset in second list: "
+ Arrays.toString(afterList2.toArray()));
}
}
}
/**
* Equality check for items.
*
* @param obj1 first item.
* @param obj2 second item.
* @param <T> item type.
* @return true if equal,false if not.
*/
private static <T> boolean areEqual(T obj1, T obj2) {
return obj1 == obj2; // naive comparison
}
/**
* Get largest subset (first occurrence) for given lists.
*
* @param list1 first list.
* @param list2 second list.
* @param <T> list item type.
* @return Largest sub sequence list, or empty list.
*/
private static <T> SubSet<T> getLargestSubSet(List<T> list1, List<T> list2) {
SubSet<T> output = null;
for (int i = 0; i < list1.size(); i++) {
for (int j = 0; j < list2.size(); j++) {
// optimisation : exit early
if (output != null && output.items.size() > Math.min(list1.size(), list2.size())) {
return output;
}
if (areEqual(list1.get(i), list2.get(j))) {
// inspect sub sequence from this (i,j) onwards
output = inspectSubSet(list1, list2, i, j, output);
}
}
}
return output;
}
/**
* For given starting indices, inspect if there is a larger subset, than given one.
*
* @param list1 first list.
* @param list2 second list.
* @param index1 first index.
* @param index2 second index.
* @param oldSubSet existing largest subset, for comparison.
* @param <T> list item type.
* @return larger subset, if found, else existing one is returned as is.
*/
private static <T> SubSet<T> inspectSubSet(List<T> list1, List<T> list2,
int index1, int index2, SubSet<T> oldSubSet) {
// new subset candidate
SubSet<T> newSubSet = new SubSet<T>();
newSubSet.items = new ArrayList<T>();
newSubSet.startIndex1 = index1;
newSubSet.endIndex1 = index1;
newSubSet.startIndex2 = index2;
newSubSet.endIndex2 = index2;
// keep building subset as subsequent items keep matching
do {
newSubSet.items.add(list1.get(index1));
newSubSet.endIndex1 = index1;
newSubSet.endIndex2 = index2;
index1++;
index2++;
} while (index1 < list1.size() && index2 < list2.size()
&& areEqual(list1.get(index1), list2.get(index2)));
// return first, larger or same.
if (oldSubSet == null) {
return newSubSet;
} else if (newSubSet.items.size() > oldSubSet.items.size()) {
return newSubSet;
} else {
return oldSubSet;
}
}
}
Run Code Online (Sandbox Code Playgroud)
输出:
First list: [1, 2, 3, 4, 5, 6, 3, 2, 5, 6, 7, 3, 8]
Second list: [2, 8, 7, 2, 3, 4, 5, 3, 2, 5, 1, 5]
Largest subset: [2, 3, 4, 5]
Items before largest subset in first list: [1]
Items after largest subset in first list: [6, 3, 2, 5, 6, 7, 3, 8]
Items before largest subset in second list: [2, 8, 7]
Items after largest subset in second list: [3, 2, 5, 1, 5]
Run Code Online (Sandbox Code Playgroud)
| 归档时间: |
|
| 查看次数: |
1271 次 |
| 最近记录: |