373591 (1) [Avatar] Offline
#1
I have uploaded the java file I am using for clustering a bunch of docs. The error message is given below. Any help would be great. I have tried everything I could find but still getting the exception - No input clusters found even though the file part-randomSeed exists in hdfs.

15/09/25 14:51:46 INFO common.HadoopUtil: Deleting /apps/tm/clustering/callcenter/outputBase/sparse/partial-vectors-0
15/09/25 14:51:46 INFO common.AbstractJob: Command line arguments: {--clustering=null, --clusters=[/apps/tm/clustering/callcenter/outputBase/callcenter-kmeans-initial-clusters], --convergenceDelta=[0.5], --distanceMeasure=[org.apache.mahout.common.distance.CosineDistanceMeasure], --endPhase=[2147483647], --input=[/apps/tm/clustering/callcenter/outputBase/sparse/tfidf-vectors], --maxIter=[10], --method=[sequential], --numClusters=[250], --output=[/apps/tm/clustering/callcenter/outputBase/kmeans], --overwrite=null, --startPhase=[0], --tempDir=[temp]}
15/09/25 14:51:46 INFO zlib.ZlibFactory: Successfully loaded & initialized native-zlib library
15/09/25 14:51:46 INFO compress.CodecPool: Got brand-new compressor [.deflate]
15/09/25 14:51:46 INFO kmeans.RandomSeedGenerator: Wrote 250 Klusters to /apps/tm/clustering/callcenter/outputBase/callcenter-kmeans-initial-clusters/part-randomSeed
15/09/25 14:51:46 INFO kmeans.KMeansDriver: Input: /apps/tm/clustering/callcenter/outputBase/sparse/tfidf-vectors Clusters In: /apps/tm/clustering/callcenter/outputBase/callcenter-kmeans-initial-clusters/part-randomSeed Out: /apps/tm/clustering/callcenter/outputBase/kmeans
15/09/25 14:51:46 INFO kmeans.KMeansDriver: convergence: 0.5 max Iterations: 10
15/09/25 14:51:46 INFO compress.CodecPool: Got brand-new decompressor [.deflate]
Exception in thread "main" java.lang.IllegalStateException: No input clusters found in /apps/tm/clustering/callcenter/outputBase/callcenter-kmeans-initial-clusters/part-randomSeed. Check your -c argument.
at org.apache.mahout.clustering.kmeans.KMeansDriver.buildClusters(KMeansDriver.java:213)
at org.apache.mahout.clustering.kmeans.KMeansDriver.run(KMeansDriver.java:147)
at org.apache.mahout.clustering.kmeans.KMeansDriver.run(KMeansDriver.java:110)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
at clustering.callcenter.FormatConverterTextToSequenceDriver.cluster(FormatConverterTextToSequenceDriver.java:192)
at clustering.callcenter.FormatConverterTextToSequenceDriver.run(FormatConverterTextToSequenceDriver.java:120)
at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
at clustering.callcenter.FormatConverterTextToSequenceDriver.main(FormatConverterTextToSequenceDriver.java:196)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
at org.apache.hadoop.util.RunJar.run(RunJar.java:221)
at org.apache.hadoop.util.RunJar.main(RunJar.java:136)