a CLUTO clustering scalability question
I'm looking at different clustering packages and was wondering about CLUTO's clustering scalability
I'm looking to cluster several millions of documents/instances(1-10 million) with several hundreds of thousands of features (100,000-300,000)
could CLUTO handle this load? if not, what load could it handle approximately?
how important is it for me to do some feature reduction in that respect?
thank you so much