Efficient determination of which strings in an array are substrings of the others?

Posted by byte on Stack Overflow See other posts from Stack Overflow or by byte
Published on 2010-06-09T19:41:42Z Indexed on 2010/06/09 19:52 UTC
Read the original article Hit count: 158

Filed under:
|
|

In C#, Say you have an array of strings, which contain only characters '0' and '1':

string[] input = { "0101", "101", "11", "010101011" };

And you'd like to build a function:

public void IdentifySubstrings(string[] input) { ... }

That will produce the following:

"0101 is a substring of 010101011"
"101 is a substring of 0101"
"101 is a substring of 010101011"
"11 is a substring of 010101011"

And you are NOT able to use built-in string functionality (such as String.Substring).

How would one efficiently solve this problem? Of course you could plow through it via brute force, but it just feels like there ought to be a way to accomplish it with a tree (since the only values are 0's and 1's, it feels like a binary tree ought to fit somehow). I've read a little bit about things like suffix trees, but I'm uncertain if that's the right path to be going down.

Any efficient solutions you can think of?

© Stack Overflow or respective owner

Related posts about c#

Related posts about algorithm