Skip to content

Similar String Groups⚓︎

Link

Description⚓︎

Two strings, X and Y, are considered similar if either they are identical or we can make them equivalent by swapping at most two letters (in distinct positions) within the string X.

For example, "tars" and "rats" are similar (swapping at positions 0 and 2), and "rats" and "arts" are similar, but "star" is not similar to "tars", "rats", or "arts".

Together, these form two connected groups by similarity: {"tars", "rats", "arts"} and {"star"}. Notice that "tars" and "arts" are in the same group even though they are not similar. Formally, each group is such that a word is in the group if and only if it is similar to at least one other word in the group.

We are given a list strs of strings where every string in strs is an anagram of every other string in strs. How many groups are there?

Example 1:

  • Input: strs = ["tars","rats","arts","star"]
  • Output: 2

Example 2:

  • Input: strs = ["omv","ovm"]
  • Output: 1

Constraints:

  • 1 <= strs.length <= 300
  • 1 <= strs[i].length <= 300
  • strs[i] consists of lowercase letters only.
  • All words in strs have the same length and are anagrams of each other.

Solution⚓︎

class Solution {
private:
    vector<int> p;
    int sets;

    void build(int n) {
        p = vector<int>(n, 0);
        for (int i = 0; i < n; i++) {
            p[i] = i;
        }
        sets = n;
    }

    int find(int i) {
        if (i != p[i]) p[i] = find(p[i]);
        return p[i];
    }

    void unionSets(int x, int y) {
        int fx = find(x), fy = find(y);
        if (fx != fy) {
            p[fx] = fy;
            sets--;
        }
    }

    bool check(const string& a, const string& b) {
        if (a == b) return true;
        int num = 0;
        for (int i = 0; i < a.size(); i++) {
            if (a[i] != b[i]) {
                num++;
                if (num > 2) return false;
            }
        }
        return true;
    }

public:
    int numSimilarGroups(vector<string>& strs) {
        int n = strs.size();
        build(n);

        for (int i = 0; i < n; i++) {
            for (int j = i + 1; j < n; j++) {
                if (check(strs[i], strs[j])) {
                    unionSets(i, j);
                }
            }
        }

        return sets;
    }
};

See reference (Chinese).

  • Time complexity: \(O(n^2 m + n\log n)\);
  • Space complexity: \(O(n)\).