Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[jvm-packages] error using a custom evaluation function (setCustomEval) on spark #3595

Open
wsobra opened this issue Aug 16, 2018 · 19 comments

Comments

@wsobra
Copy link

wsobra commented Aug 16, 2018

I am trying to use a custom evaluation function (setCustomEval) on spark. However, it is giving the following error:

[ERROR] [08/16/2018 13:34:42.313] [RabitTracker-akka.actor.default-dispatcher-11] [LocalActorRefProvider(akka://RabitTracker)] guardian failed, shutting down system
java.lang.AssertionError: assertion failed
at scala.Predef$.assert(Predef.scala:156)
at ml.dmlc.xgboost4j.scala.rabit.handler.WorkerDependencyResolver$$anonfun$receive$2.applyOrElse(RabitTrackerHandler.scala:298)
at akka.actor.Actor$class.aroundReceive(Actor.scala:467)
at ml.dmlc.xgboost4j.scala.rabit.handler.WorkerDependencyResolver.aroundReceive(RabitTrackerHandler.scala:264)
at akka.actor.ActorCell.receiveMessage(ActorCell.scala:516)
at akka.actor.ActorCell.invoke(ActorCell.scala:487)
at akka.dispatch.Mailbox.processMailbox(Mailbox.scala:238)
at akka.dispatch.Mailbox.run(Mailbox.scala:220)
at akka.dispatch.ForkJoinExecutorConfigurator$AkkaForkJoinTask.exec(AbstractDispatcher.scala:397)
at scala.concurrent.forkjoin.ForkJoinTask.doExec(ForkJoinTask.java:260)
at scala.concurrent.forkjoin.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1339)
at scala.concurrent.forkjoin.ForkJoinPool.runWorker(ForkJoinPool.java:1979)
at scala.concurrent.forkjoin.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:107)

@wsobra wsobra changed the title error using custom evaluation function (setCustomEval) on spark error using a custom evaluation function (setCustomEval) on spark Aug 16, 2018
@wsobra wsobra changed the title error using a custom evaluation function (setCustomEval) on spark [jvm-packages] error using a custom evaluation function (setCustomEval) on spark Aug 16, 2018
@hcho3
Copy link
Collaborator

hcho3 commented Aug 16, 2018

What's your cluster setup? And can you post your custom evaluation function?

@wsobra
Copy link
Author

wsobra commented Aug 16, 2018

I have seven r4.2xlarge on AWS EMR and my spark-submit is:

spark-submit --driver-memory 15G --executor-memory 10G --conf spark.executor.cores=4 --conf spark.task.cpus=2 --conf spark.executor.extraJavaOptions="-XX:ThreadStackSize=41920" --class Main Main.jar

I have tested this example custom evaluation and is not working -- > https://github.com/dmlc/xgboost/blob/master/jvm-packages/xgboost4j-example/src/main/scala/ml/dmlc/xgboost4j/scala/example/util/CustomEval.scala

On my local machine is working.

@CodingCat
Copy link
Member

are you using akka-version rabit? can you try python one?

@wsobra
Copy link
Author

wsobra commented Aug 16, 2018

With python the execution is freezed. My code:

var paramsXGB:Map[String, Any] = Map[String, Any]("tracker_conf" -> TrackerConf(600*3000L, "python"))

var xgb = new XGBoostClassifier(paramsXGB).setFeaturesCol("features").setNumRound(200).setNumWorkers(20).setObjective("binary:logistic").setSilent(0).setMaxDepth(4.0).setMinChildWeight(5.0).setGamma(0.0).setEta(0.2).setSubsample(0.9).setColsampleBytree(inparams.colSampleByTree).setAlpha(0.0).setScalePosWeight(120).setNthread(2).setSeed(12345).setMissing(0).setNumEarlyStoppingRounds(10).setTrainTestRatio(0.9)

The problem happens when I use NumEarlyStoppingRounds as stop criterion.

@CodingCat
Copy link
Member

how many executors you tried to claim?

you are requiring 20 * 2 cores, but I didn't see num-executors parameter in your spark-submit

@wsobra
Copy link
Author

wsobra commented Aug 16, 2018

@hcho3, @CodingCat, thanks for the help.

I set now num-executors equal to 20 and execution continue freezing.

@CodingCat
Copy link
Member

can you read this part:https://xgboost.readthedocs.io/en/latest/jvm/xgboost4j_spark_tutorial.html#gang-scheduling

and set a timeout threshold for resource claiming, I think somehow you didn't get enough resources for training

by setting this, your application should fail if not being able to get enough cores within your thresholding time (ms)

@wsobra
Copy link
Author

wsobra commented Aug 17, 2018

Thank you, it's working now. However, the code that motivated me to test the custom evaluation function example has same freeze problem. Below is the code that calculates the maximum KS (http://www.physics.csbsju.edu/stats/KS-test.html). Any suggestion?

class KSEvaluation extends EvalTrait {
  val logger = LogFactory.getLog(classOf[KSEvaluation])  
  private[neurotech] var evalMetric: String = "ks"
  override def getMetric: String = evalMetric
 
  override def eval(predicts: Array[Array[Float]], dmat: DMatrix): Float = {
    var error: Float = 0f
    var labels: Array[Float] = null
    try {
      labels = dmat.getLabel
    } catch {
      case ex: XGBoostError =>
        logger.error(ex)
        return -1f
    }
    
    require(predicts.length == labels.length, s"predicts length ${predicts.length} has to be" +
      s" equal with label length ${labels.length}")
    
    var bins = 256
    val nrow: Int = predicts.length
    
    var p0 = getCol(1, predicts)   
    var norm = discretize(p0, bins)
    
    var maus = Array.fill[Int](bins)(0)
    var bons = Array.fill[Int](bins)(0)
    
    for (i <- 0 until nrow) {
      var l = labels(i).toInt
      bons(norm(i)) += 1 - l
      maus(norm(i)) += l
    }
    
    var bonsf = bons.map(_ / (bons.sum*1.0f))
    var mausf = maus.map(_ / (maus.sum*1.0f))
    
    var freq_ac_bom = Array.fill[Float](bins)(0)
    var freq_ac_mau = Array.fill[Float](bins)(0)
    
    var maxKS:Float = 0.0f
    
    for(i <-0 until bins){
      if(i==0){
        freq_ac_bom(i) = bonsf(i) 
        freq_ac_mau(i) = mausf(i)
      }else{
        freq_ac_bom(i) = freq_ac_bom(i-1) + bonsf(i) 
        freq_ac_mau(i) = freq_ac_mau(i-1) + mausf(i)
      }
      
      var d = math.abs(freq_ac_bom(i) -freq_ac_mau(i)) 
      if(d>maxKS){
        maxKS = d
      }
    }
    
    maxKS
  }
  
  def discretize(values: Array[Float], bins:Int = 256): Array[Int] = {
    var max = values.max
    var min = values.min
    var binsminusone= (bins-1)
    var minmax = (max-min)
    var norm = values.map(_ - min)
    norm = norm.map(_/ minmax)
    norm = norm.map(_ * binsminusone)
    var normi = norm.map(_.toInt)
    normi
  }
  def getCol(n: Int, a: Array[Array[Float]]) = a.map{_(n - 1)}
}

@wsobra
Copy link
Author

wsobra commented Aug 21, 2018

When I use the KSEvaluation function happens this (I set a timeout threshold for resource claiming and I am using python):

18/08/21 18:21:47 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:47,239 DEBUG Recieve start signal from 172.31.47.176; assign rank 18
18/08/21 18:21:47 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:47,327 DEBUG Recieve start signal from 172.31.47.176; assign rank 19
18/08/21 18:21:47 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:47,328 INFO @tracker All of 20 nodes getting started
18/08/21 18:21:50 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:50,174 INFO [0]	train-ks:0.209919	[0]	test-ks:0.218397
18/08/21 18:21:50 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:50,696 INFO [1]	train-ks:0.231111	[1]	test-ks:0.244571
18/08/21 18:21:51 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:51,224 INFO [2]	train-ks:0.246865	[2]	test-ks:0.258942
18/08/21 18:21:51 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:51,844 INFO [3]	train-ks:0.253572	[3]	test-ks:0.261869
18/08/21 18:21:52 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:52,432 INFO [4]	train-ks:0.259405	[4]	test-ks:0.267962
18/08/21 18:21:53 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:53,058 INFO [5]	train-ks:0.262024	[5]	test-ks:0.270507
18/08/21 18:21:53 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:53,567 INFO [6]	train-ks:0.265157	[6]	test-ks:0.272917
18/08/21 18:21:54 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:54,164 INFO [7]	train-ks:0.271893	[7]	test-ks:0.279263
18/08/21 18:21:54 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:54,667 INFO [8]	train-ks:0.275109	[8]	test-ks:0.280031
18/08/21 18:21:55 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:55,198 INFO [9]	train-ks:0.277155	[9]	test-ks:0.285437
18/08/21 18:21:55 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:55,733 INFO early stopping
18/08/21 18:21:55 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:55,734 INFO [10]	train-ks:0.277285	[10]	test-ks:0.285755
18/08/21 18:21:55 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:55,736 DEBUG Recieve shutdown signal from 9
18/08/21 18:21:55 INFO BlockManagerInfo: Added rdd_285_6 in memory on ip-172-31-42-49.ec2.internal:41901 (size: 4.2 KB, free: 4.5 GB)
18/08/21 18:21:55 INFO TaskSetManager: Finished task 6.0 in stage 51.0 (TID 1982) in 10596 ms on ip-172-31-42-49.ec2.internal (executor 11) (1/20)
18/08/21 18:21:55 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:55,868 DEBUG Recieve recover signal from 12
18/08/21 18:21:55 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:55,956 DEBUG Recieve recover signal from 11
18/08/21 18:21:56 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:56,044 DEBUG Recieve recover signal from 10
18/08/21 18:21:56 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:56,131 DEBUG Recieve recover signal from 1
18/08/21 18:21:56 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:56,218 DEBUG Recieve recover signal from 0
18/08/21 18:21:56 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:56,307 DEBUG Recieve recover signal from 2
18/08/21 18:21:56 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:56,395 DEBUG Recieve recover signal from 3
18/08/21 18:21:56 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:56,483 DEBUG Recieve recover signal from 19
18/08/21 18:21:56 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:56,570 DEBUG Recieve recover signal from 8
18/08/21 18:21:56 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:56,656 DEBUG Recieve recover signal from 5
18/08/21 18:21:56 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:56,743 DEBUG Recieve recover signal from 18
18/08/21 18:21:56 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:56,831 DEBUG Recieve recover signal from 17
18/08/21 18:21:56 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:56,918 DEBUG Recieve recover signal from 7
18/08/21 18:21:57 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:57,004 DEBUG Recieve recover signal from 6
18/08/21 18:21:57 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:57,092 DEBUG Recieve recover signal from 4
18/08/21 18:21:57 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:57,178 DEBUG Recieve recover signal from 13
18/08/21 18:21:57 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:57,266 DEBUG Recieve recover signal from 14
18/08/21 18:21:57 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:57,354 DEBUG Recieve recover signal from 16
18/08/21 18:21:57 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:57,442 DEBUG Recieve recover signal from 15
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1422
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1395
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1374
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1388
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1378
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1406
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1471
18/08/21 18:22:25 INFO ContextCleaner: Cleaned shuffle 16
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1421
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1415
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1435
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1452
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1385
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_76_piece0 on ip-172-31-45-119.ec2.internal:43921 in memory (size: 4.2 KB, free: 8.8 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_76_piece0 on ip-172-31-41-206.ec2.internal:46813 in memory (size: 4.2 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1462
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1409
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1436
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1400
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1381
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1453
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1438
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1445
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1467
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1461
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1372
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1367
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1430
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1426
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1432
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1425
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1451
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1402
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1441
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1393
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1428
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1412
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1384
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1373
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1475
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1460
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1377
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_74_piece0 on ip-172-31-45-119.ec2.internal:43921 in memory (size: 44.7 KB, free: 8.8 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_74_piece0 on ip-172-31-45-37.ec2.internal:45009 in memory (size: 44.7 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_74_piece0 on ip-172-31-41-206.ec2.internal:46813 in memory (size: 44.7 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_74_piece0 on ip-172-31-45-37.ec2.internal:34419 in memory (size: 44.7 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_74_piece0 on ip-172-31-34-158.ec2.internal:34059 in memory (size: 44.7 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_74_piece0 on ip-172-31-34-158.ec2.internal:41739 in memory (size: 44.7 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_74_piece0 on ip-172-31-47-176.ec2.internal:43819 in memory (size: 44.7 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_74_piece0 on ip-172-31-42-49.ec2.internal:40453 in memory (size: 44.7 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_74_piece0 on ip-172-31-45-37.ec2.internal:32973 in memory (size: 44.7 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_74_piece0 on ip-172-31-41-206.ec2.internal:43671 in memory (size: 44.7 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_74_piece0 on ip-172-31-41-206.ec2.internal:39451 in memory (size: 44.7 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_74_piece0 on ip-172-31-47-176.ec2.internal:35951 in memory (size: 44.7 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_74_piece0 on ip-172-31-45-119.ec2.internal:33785 in memory (size: 44.7 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_74_piece0 on ip-172-31-45-37.ec2.internal:36015 in memory (size: 44.7 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_74_piece0 on ip-172-31-42-49.ec2.internal:36673 in memory (size: 44.7 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_74_piece0 on ip-172-31-34-158.ec2.internal:44173 in memory (size: 44.7 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_74_piece0 on ip-172-31-45-119.ec2.internal:32997 in memory (size: 44.7 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_74_piece0 on ip-172-31-47-176.ec2.internal:37787 in memory (size: 44.7 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_74_piece0 on ip-172-31-45-119.ec2.internal:33065 in memory (size: 44.7 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_74_piece0 on ip-172-31-34-158.ec2.internal:42939 in memory (size: 44.7 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_74_piece0 on ip-172-31-42-49.ec2.internal:41901 in memory (size: 44.7 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1404
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1477
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1382
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1413
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1423
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1444
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1394
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1443
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1418
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1447
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1390
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1427
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1387
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1456
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1454
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1391
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1480
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1376
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1469
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1479
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1405
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1429
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1433
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1448
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1458
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1397
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1474
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1386
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1455
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1417
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1370
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1470
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1473
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1450
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1408
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1414
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1368
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1442
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1403
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1383
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1407
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_73_piece0 on ip-172-31-45-119.ec2.internal:43921 in memory (size: 24.9 KB, free: 8.8 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_73_piece0 on ip-172-31-34-158.ec2.internal:34059 in memory (size: 24.9 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_73_piece0 on ip-172-31-45-37.ec2.internal:34419 in memory (size: 24.9 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_73_piece0 on ip-172-31-41-206.ec2.internal:39451 in memory (size: 24.9 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_73_piece0 on ip-172-31-42-49.ec2.internal:36673 in memory (size: 24.9 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_73_piece0 on ip-172-31-41-206.ec2.internal:43671 in memory (size: 24.9 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_73_piece0 on ip-172-31-34-158.ec2.internal:44173 in memory (size: 24.9 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_73_piece0 on ip-172-31-45-119.ec2.internal:33785 in memory (size: 24.9 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_73_piece0 on ip-172-31-34-158.ec2.internal:42939 in memory (size: 24.9 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_73_piece0 on ip-172-31-42-49.ec2.internal:41901 in memory (size: 24.9 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_73_piece0 on ip-172-31-47-176.ec2.internal:43819 in memory (size: 24.9 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_73_piece0 on ip-172-31-34-158.ec2.internal:41739 in memory (size: 24.9 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_73_piece0 on ip-172-31-45-37.ec2.internal:32973 in memory (size: 24.9 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_73_piece0 on ip-172-31-45-119.ec2.internal:32997 in memory (size: 24.9 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1446
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1416
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1401
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1410
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1478
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1466
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1369
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1437
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1371
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1463
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1459
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1392
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1457
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1476
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1379
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_75_piece0 on ip-172-31-45-119.ec2.internal:43921 in memory (size: 7.8 KB, free: 8.8 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_75_piece0 on ip-172-31-45-37.ec2.internal:34419 in memory (size: 7.8 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_75_piece0 on ip-172-31-45-37.ec2.internal:45009 in memory (size: 7.8 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_75_piece0 on ip-172-31-47-176.ec2.internal:37787 in memory (size: 7.8 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_75_piece0 on ip-172-31-45-119.ec2.internal:33065 in memory (size: 7.8 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_75_piece0 on ip-172-31-34-158.ec2.internal:41739 in memory (size: 7.8 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_75_piece0 on ip-172-31-41-206.ec2.internal:46813 in memory (size: 7.8 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_75_piece0 on ip-172-31-47-176.ec2.internal:43819 in memory (size: 7.8 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_75_piece0 on ip-172-31-42-49.ec2.internal:40453 in memory (size: 7.8 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_75_piece0 on ip-172-31-45-37.ec2.internal:32973 in memory (size: 7.8 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_75_piece0 on ip-172-31-34-158.ec2.internal:42939 in memory (size: 7.8 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_75_piece0 on ip-172-31-42-49.ec2.internal:41901 in memory (size: 7.8 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_75_piece0 on ip-172-31-41-206.ec2.internal:39451 in memory (size: 7.8 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_75_piece0 on ip-172-31-45-119.ec2.internal:32997 in memory (size: 7.8 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_75_piece0 on ip-172-31-42-49.ec2.internal:36673 in memory (size: 7.8 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_75_piece0 on ip-172-31-41-206.ec2.internal:43671 in memory (size: 7.8 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_75_piece0 on ip-172-31-47-176.ec2.internal:35951 in memory (size: 7.8 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_75_piece0 on ip-172-31-45-119.ec2.internal:33785 in memory (size: 7.8 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_75_piece0 on ip-172-31-34-158.ec2.internal:34059 in memory (size: 7.8 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_75_piece0 on ip-172-31-45-37.ec2.internal:36015 in memory (size: 7.8 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO BlockManagerInfo: Removed broadcast_75_piece0 on ip-172-31-34-158.ec2.internal:44173 in memory (size: 7.8 KB, free: 4.5 GB)
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1389
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1419
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1431
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1449
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1375
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1434
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1380
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1424
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1468
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1440
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1399
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1439
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1366
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1464
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1472
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1420
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1411
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1396
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1398
18/08/21 18:22:25 INFO ContextCleaner: Cleaned accumulator 1465

@CodingCat
Copy link
Member

18/08/21 18:21:55 INFO RabitTracker$TrackerProcessLogger: 2018-08-21 18:21:55,868 DEBUG Recieve recover signal from 12

this is suspicious, can you track any of your executor die? or even did you turn on something like dynamic allocation?

@wsobra
Copy link
Author

wsobra commented Aug 22, 2018

@CodingCat , I set spark.dynamicAllocation.enabled equal to false. Below is a log and an image of the active tasks. I have noticed that two executors satisfy the stop criterion and complete the task (6 and 9). Two executors (4 and 15) are not active (but they have not stopped because of the stop criterion). Could that be the problem? Other difference is that executors 6 and 9 appear as RDD block.

18/08/22 18:04:14 INFO RabitTracker$TrackerProcessLogger: 2018-08-22 18:04:14,379 DEBUG Recieve start signal from 172.31.47.20; assign rank 18
18/08/22 18:04:14 INFO RabitTracker$TrackerProcessLogger: 2018-08-22 18:04:14,467 DEBUG Recieve start signal from 172.31.47.20; assign rank 19
18/08/22 18:04:14 INFO RabitTracker$TrackerProcessLogger: 2018-08-22 18:04:14,468 INFO @tracker All of 20 nodes getting started
18/08/22 18:04:17 INFO RabitTracker$TrackerProcessLogger: 2018-08-22 18:04:17,596 INFO [0]	train-ks:0.790145	[0]	test-ks:0.792545
18/08/22 18:04:18 INFO RabitTracker$TrackerProcessLogger: 2018-08-22 18:04:18,776 INFO [1]	train-ks:0.757689	[1]	test-ks:0.769063
18/08/22 18:04:19 INFO RabitTracker$TrackerProcessLogger: 2018-08-22 18:04:19,935 INFO early stopping after 10 decreasing rounds
18/08/22 18:04:19 INFO RabitTracker$TrackerProcessLogger: 2018-08-22 18:04:19,936 INFO early stopping after 10 decreasing rounds
18/08/22 18:04:19 INFO RabitTracker$TrackerProcessLogger: 2018-08-22 18:04:19,939 INFO [2]	train-ks:0.749513	[2]	test-ks:0.760905
18/08/22 18:04:19 INFO RabitTracker$TrackerProcessLogger: 2018-08-22 18:04:19,952 DEBUG Recieve shutdown signal from 18
18/08/22 18:04:19 INFO BlockManagerInfo: Added rdd_175_14 in memory on ip-172-31-47-20.ec2.internal:33735 (size: 4.2 KB, free: 4.6 GB)
18/08/22 18:04:19 INFO TaskSetManager: Finished task 14.0 in stage 50.0 (TID 2187) in 7559 ms on ip-172-31-47-20.ec2.internal (executor 6) (1/20)
18/08/22 18:04:20 INFO RabitTracker$TrackerProcessLogger: 2018-08-22 18:04:20,084 DEBUG Recieve recover signal from 16
18/08/22 18:04:20 INFO RabitTracker$TrackerProcessLogger: 2018-08-22 18:04:20,171 DEBUG Recieve recover signal from 17
18/08/22 18:04:20 INFO RabitTracker$TrackerProcessLogger: 2018-08-22 18:04:20,258 DEBUG Recieve recover signal from 19
18/08/22 18:04:20 INFO RabitTracker$TrackerProcessLogger: 2018-08-22 18:04:20,344 DEBUG Recieve recover signal from 15
18/08/22 18:04:20 INFO RabitTracker$TrackerProcessLogger: 2018-08-22 18:04:20,432 DEBUG Recieve recover signal from 14
18/08/22 18:04:20 INFO RabitTracker$TrackerProcessLogger: 2018-08-22 18:04:20,518 DEBUG Recieve recover signal from 13
18/08/22 18:04:20 INFO RabitTracker$TrackerProcessLogger: 2018-08-22 18:04:20,603 DEBUG Recieve recover signal from 0
18/08/22 18:04:20 INFO RabitTracker$TrackerProcessLogger: 2018-08-22 18:04:20,691 DEBUG Recieve recover signal from 1
18/08/22 18:04:20 INFO RabitTracker$TrackerProcessLogger: 2018-08-22 18:04:20,692 DEBUG Recieve shutdown signal from 4
18/08/22 18:04:20 INFO BlockManagerInfo: Added rdd_175_8 in memory on ip-172-31-35-42.ec2.internal:40631 (size: 4.2 KB, free: 4.6 GB)
18/08/22 18:04:20 INFO TaskSetManager: Finished task 8.0 in stage 50.0 (TID 2181) in 8300 ms on ip-172-31-35-42.ec2.internal (executor 9) (2/20)
18/08/22 18:04:20 INFO RabitTracker$TrackerProcessLogger: 2018-08-22 18:04:20,779 DEBUG Recieve recover signal from 2
18/08/22 18:04:20 INFO RabitTracker$TrackerProcessLogger: 2018-08-22 18:04:20,867 DEBUG Recieve recover signal from 3
18/08/22 18:04:20 INFO RabitTracker$TrackerProcessLogger: 2018-08-22 18:04:20,954 DEBUG Recieve recover signal from 8
18/08/22 18:04:21 INFO RabitTracker$TrackerProcessLogger: 2018-08-22 18:04:21,041 DEBUG Recieve recover signal from 7
18/08/22 18:04:21 INFO RabitTracker$TrackerProcessLogger: 2018-08-22 18:04:21,127 DEBUG Recieve recover signal from 6
18/08/22 18:04:21 INFO RabitTracker$TrackerProcessLogger: 2018-08-22 18:04:21,215 DEBUG Recieve recover signal from 5
18/08/22 18:04:21 INFO RabitTracker$TrackerProcessLogger: 2018-08-22 18:04:21,302 DEBUG Recieve recover signal from 12
18/08/22 18:04:21 INFO RabitTracker$TrackerProcessLogger: 2018-08-22 18:04:21,390 DEBUG Recieve recover signal from 9
18/08/22 18:04:21 INFO RabitTracker$TrackerProcessLogger: 2018-08-22 18:04:21,478 DEBUG Recieve recover signal from 11
18/08/22 18:04:21 INFO RabitTracker$TrackerProcessLogger: 2018-08-22 18:04:21,566 DEBUG Recieve recover signal from 10

print

@CodingCat
Copy link
Member

can you give more logs on those active executors? somehow your xgboost workers sent recover signal to tracker but in driver side I didn't see any log indicating task fail

and if you turn off early stopping, is the problem still there?

@wsobra
Copy link
Author

wsobra commented Aug 24, 2018

Yes, If I turn off early stopping It works. I think that the problem happens when a worker stops according to stop criterion. Other workers is not stoping and they want continue. When use a default metric it doesn't happen. All workers stop the same time.

@wsobra
Copy link
Author

wsobra commented Aug 25, 2018

@CodingCat , To better understand what is happening, I have modified the following lines in XGBoost.java:

//if (Rabit.getRank() == 0) {
//	Rabit.trackerPrint(evalInfo + '\n');
//}
Rabit.trackerPrint("| " + Rabit.getRank() +"| ::" + evalInfo + '\n');

When I use a metric as the AUC or any other metric available for the xgboost, the behavior is this in the first log. The evaluation is done using all validation set. Notice that each worker's response is the same. When I use a custom metric, the response for each worker is different, because it only evaluates your validation minibatch. Therefore, if the stop criterion for a worker is satisfied, only it stops and the others try to continue. Please, how do I get him to evaluate the all validation set in each worker? Is there any way to make all the workers stop when one stops? or, the workers continue until each stop criterion is satisfied?

log1

18/08/25 10:40:09 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:09,400 INFO @tracker All of 20 nodes getting started
18/08/25 10:40:12 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:12,517 INFO | 7| ::[0]	train-auc:0.647258	test-auc:0.646319
18/08/25 10:40:12 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:12,518 INFO | 0| ::[0]	train-auc:0.647258	test-auc:0.646319
18/08/25 10:40:12 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:12,518 INFO | 11| ::[0]	train-auc:0.647258	test-auc:0.646319
18/08/25 10:40:12 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:12,518 INFO | 1| ::[0]	train-auc:0.647258	test-auc:0.646319
18/08/25 10:40:12 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:12,518 INFO | 9| ::[0]	train-auc:0.647258	test-auc:0.646319
18/08/25 10:40:12 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:12,519 INFO | 2| ::[0]	train-auc:0.647258	test-auc:0.646319
18/08/25 10:40:12 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:12,519 INFO | 10| ::[0]	train-auc:0.647258	test-auc:0.646319
18/08/25 10:40:12 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:12,519 INFO | 3| ::[0]	train-auc:0.647258	test-auc:0.646319
18/08/25 10:40:12 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:12,520 INFO | 5| ::[0]	train-auc:0.647258	test-auc:0.646319
18/08/25 10:40:12 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:12,520 INFO | 16| ::[0]	train-auc:0.647258	test-auc:0.646319
18/08/25 10:40:12 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:12,520 INFO | 17| ::[0]	train-auc:0.647258	test-auc:0.646319
18/08/25 10:40:12 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:12,520 INFO | 15| ::[0]	train-auc:0.647258	test-auc:0.646319
18/08/25 10:40:12 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:12,521 INFO | 13| ::[0]	train-auc:0.647258	test-auc:0.646319
18/08/25 10:40:12 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:12,521 INFO | 8| ::[0]	train-auc:0.647258	test-auc:0.646319
18/08/25 10:40:12 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:12,521 INFO | 14| ::[0]	train-auc:0.647258	test-auc:0.646319
18/08/25 10:40:12 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:12,521 INFO | 12| ::[0]	train-auc:0.647258	test-auc:0.646319
18/08/25 10:40:12 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:12,522 INFO | 4| ::[0]	train-auc:0.647258	test-auc:0.646319
18/08/25 10:40:12 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:12,522 INFO | 19| ::[0]	train-auc:0.647258	test-auc:0.646319
18/08/25 10:40:12 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:12,522 INFO | 18| ::[0]	train-auc:0.647258	test-auc:0.646319
18/08/25 10:40:12 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:12,522 INFO | 6| ::[0]	train-auc:0.647258	test-auc:0.646319
18/08/25 10:40:13 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:13,702 INFO | 0| ::[1]	train-auc:0.666105	test-auc:0.664926
18/08/25 10:40:13 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:13,702 INFO | 1| ::[1]	train-auc:0.666105	test-auc:0.664926
18/08/25 10:40:13 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:13,702 INFO | 17| ::[1]	train-auc:0.666105	test-auc:0.664926
18/08/25 10:40:13 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:13,703 INFO | 19| ::[1]	train-auc:0.666105	test-auc:0.664926
18/08/25 10:40:13 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:13,703 INFO | 13| ::[1]	train-auc:0.666105	test-auc:0.664926
18/08/25 10:40:13 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:13,703 INFO | 18| ::[1]	train-auc:0.666105	test-auc:0.664926
18/08/25 10:40:13 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:13,704 INFO | 14| ::[1]	train-auc:0.666105	test-auc:0.664926
18/08/25 10:40:13 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:13,704 INFO | 16| ::[1]	train-auc:0.666105	test-auc:0.664926
18/08/25 10:40:13 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:13,704 INFO | 15| ::[1]	train-auc:0.666105	test-auc:0.664926
18/08/25 10:40:13 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:13,704 INFO | 10| ::[1]	train-auc:0.666105	test-auc:0.664926
18/08/25 10:40:13 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:13,705 INFO | 11| ::[1]	train-auc:0.666105	test-auc:0.664926
18/08/25 10:40:13 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:13,705 INFO | 12| ::[1]	train-auc:0.666105	test-auc:0.664926
18/08/25 10:40:13 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:13,705 INFO | 5| ::[1]	train-auc:0.666105	test-auc:0.664926
18/08/25 10:40:13 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:13,706 INFO | 4| ::[1]	train-auc:0.666105	test-auc:0.664926
18/08/25 10:40:13 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:13,706 INFO | 3| ::[1]	train-auc:0.666105	test-auc:0.664926
18/08/25 10:40:13 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:13,706 INFO | 2| ::[1]	train-auc:0.666105	test-auc:0.664926
18/08/25 10:40:13 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:13,706 INFO | 6| ::[1]	train-auc:0.666105	test-auc:0.664926
18/08/25 10:40:13 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:13,706 INFO | 9| ::[1]	train-auc:0.666105	test-auc:0.664926
18/08/25 10:40:13 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:13,707 INFO | 8| ::[1]	train-auc:0.666105	test-auc:0.664926
18/08/25 10:40:13 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:13,707 INFO | 7| ::[1]	train-auc:0.666105	test-auc:0.664926
18/08/25 10:40:14 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:14,871 INFO | 0| ::[2]	train-auc:0.677590	test-auc:0.675422
18/08/25 10:40:14 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:14,871 INFO | 8| ::[2]	train-auc:0.677590	test-auc:0.675422
18/08/25 10:40:14 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:14,871 INFO | 1| ::[2]	train-auc:0.677590	test-auc:0.675422
18/08/25 10:40:14 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:14,871 INFO | 2| ::[2]	train-auc:0.677590	test-auc:0.675422
18/08/25 10:40:14 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:14,872 INFO | 18| ::[2]	train-auc:0.677590	test-auc:0.675422
18/08/25 10:40:14 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:14,872 INFO | 17| ::[2]	train-auc:0.677590	test-auc:0.675422
18/08/25 10:40:14 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:14,872 INFO | 13| ::[2]	train-auc:0.677590	test-auc:0.675422
18/08/25 10:40:14 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:14,873 INFO | 14| ::[2]	train-auc:0.677590	test-auc:0.675422
18/08/25 10:40:14 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:14,873 INFO | 19| ::[2]	train-auc:0.677590	test-auc:0.675422
18/08/25 10:40:14 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:14,873 INFO | 15| ::[2]	train-auc:0.677590	test-auc:0.675422
18/08/25 10:40:14 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:14,873 INFO | 10| ::[2]	train-auc:0.677590	test-auc:0.675422
18/08/25 10:40:14 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:14,874 INFO | 3| ::[2]	train-auc:0.677590	test-auc:0.675422
18/08/25 10:40:14 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:14,874 INFO | 4| ::[2]	train-auc:0.677590	test-auc:0.675422
18/08/25 10:40:14 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:14,874 INFO | 16| ::[2]	train-auc:0.677590	test-auc:0.675422
18/08/25 10:40:14 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:14,874 INFO | 12| ::[2]	train-auc:0.677590	test-auc:0.675422
18/08/25 10:40:14 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:14,875 INFO | 5| ::[2]	train-auc:0.677590	test-auc:0.675422
18/08/25 10:40:14 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:14,875 INFO | 6| ::[2]	train-auc:0.677590	test-auc:0.675422
18/08/25 10:40:14 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:14,875 INFO | 11| ::[2]	train-auc:0.677590	test-auc:0.675422
18/08/25 10:40:14 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:14,875 INFO | 7| ::[2]	train-auc:0.677590	test-auc:0.675422
18/08/25 10:40:14 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:14,876 INFO | 9| ::[2]	train-auc:0.677590	test-auc:0.675422
18/08/25 10:40:16 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:16,058 INFO | 8| ::[3]	train-auc:0.681147	test-auc:0.678610
18/08/25 10:40:16 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:16,058 INFO | 0| ::[3]	train-auc:0.681147	test-auc:0.678610
18/08/25 10:40:16 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:16,058 INFO | 7| ::[3]	train-auc:0.681147	test-auc:0.678610
18/08/25 10:40:16 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:16,058 INFO | 1| ::[3]	train-auc:0.681147	test-auc:0.678610
18/08/25 10:40:16 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:16,059 INFO | 2| ::[3]	train-auc:0.681147	test-auc:0.678610
18/08/25 10:40:16 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:16,059 INFO | 3| ::[3]	train-auc:0.681147	test-auc:0.678610
18/08/25 10:40:16 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:16,059 INFO | 14| ::[3]	train-auc:0.681147	test-auc:0.678610
18/08/25 10:40:16 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:16,060 INFO | 13| ::[3]	train-auc:0.681147	test-auc:0.678610
18/08/25 10:40:16 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:16,060 INFO | 10| ::[3]	train-auc:0.681147	test-auc:0.678610
18/08/25 10:40:16 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:16,060 INFO | 15| ::[3]	train-auc:0.681147	test-auc:0.678610
18/08/25 10:40:16 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:16,060 INFO | 12| ::[3]	train-auc:0.681147	test-auc:0.678610
18/08/25 10:40:16 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:16,061 INFO | 19| ::[3]	train-auc:0.681147	test-auc:0.678610
18/08/25 10:40:16 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:16,061 INFO | 16| ::[3]	train-auc:0.681147	test-auc:0.678610
18/08/25 10:40:16 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:16,061 INFO | 4| ::[3]	train-auc:0.681147	test-auc:0.678610
18/08/25 10:40:16 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:16,062 INFO | 18| ::[3]	train-auc:0.681147	test-auc:0.678610
18/08/25 10:40:16 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:16,062 INFO | 6| ::[3]	train-auc:0.681147	test-auc:0.678610
18/08/25 10:40:16 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:16,062 INFO | 17| ::[3]	train-auc:0.681147	test-auc:0.678610
18/08/25 10:40:16 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:16,062 INFO | 5| ::[3]	train-auc:0.681147	test-auc:0.678610
18/08/25 10:40:16 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:16,063 INFO | 11| ::[3]	train-auc:0.681147	test-auc:0.678610
18/08/25 10:40:16 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:16,063 INFO | 9| ::[3]	train-auc:0.681147	test-auc:0.678610
18/08/25 10:40:17 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:17,215 INFO | 7| ::[4]	train-auc:0.683902	test-auc:0.681537
18/08/25 10:40:17 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:17,216 INFO | 0| ::[4]	train-auc:0.683902	test-auc:0.681537
18/08/25 10:40:17 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:17,216 INFO | 2| ::[4]	train-auc:0.683902	test-auc:0.681537
18/08/25 10:40:17 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:17,216 INFO | 19| ::[4]	train-auc:0.683902	test-auc:0.681537
18/08/25 10:40:17 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:17,216 INFO | 1| ::[4]	train-auc:0.683902	test-auc:0.681537
18/08/25 10:40:17 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:17,217 INFO | 12| ::[4]	train-auc:0.683902	test-auc:0.681537
18/08/25 10:40:17 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:17,217 INFO | 18| ::[4]	train-auc:0.683902	test-auc:0.681537
18/08/25 10:40:17 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:17,217 INFO | 10| ::[4]	train-auc:0.683902	test-auc:0.681537
18/08/25 10:40:17 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:17,218 INFO | 17| ::[4]	train-auc:0.683902	test-auc:0.681537
18/08/25 10:40:17 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:17,218 INFO | 6| ::[4]	train-auc:0.683902	test-auc:0.681537
18/08/25 10:40:17 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:17,218 INFO | 3| ::[4]	train-auc:0.683902	test-auc:0.681537
18/08/25 10:40:17 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:17,218 INFO | 15| ::[4]	train-auc:0.683902	test-auc:0.681537
18/08/25 10:40:17 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:17,219 INFO | 16| ::[4]	train-auc:0.683902	test-auc:0.681537
18/08/25 10:40:17 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:17,219 INFO | 5| ::[4]	train-auc:0.683902	test-auc:0.681537
18/08/25 10:40:17 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:17,219 INFO | 13| ::[4]	train-auc:0.683902	test-auc:0.681537
18/08/25 10:40:17 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:17,219 INFO | 8| ::[4]	train-auc:0.683902	test-auc:0.681537
18/08/25 10:40:17 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:17,220 INFO | 4| ::[4]	train-auc:0.683902	test-auc:0.681537
18/08/25 10:40:17 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:17,220 INFO | 14| ::[4]	train-auc:0.683902	test-auc:0.681537
18/08/25 10:40:17 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:17,220 INFO | 11| ::[4]	train-auc:0.683902	test-auc:0.681537
18/08/25 10:40:17 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:17,220 INFO | 9| ::[4]	train-auc:0.683902	test-auc:0.681537
18/08/25 10:40:18 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:18,359 INFO | 7| ::[5]	train-auc:0.685133	test-auc:0.682716
18/08/25 10:40:18 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:18,359 INFO | 1| ::[5]	train-auc:0.685133	test-auc:0.682716
18/08/25 10:40:18 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:18,359 INFO | 2| ::[5]	train-auc:0.685133	test-auc:0.682716
18/08/25 10:40:18 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:18,360 INFO | 0| ::[5]	train-auc:0.685133	test-auc:0.682716
18/08/25 10:40:18 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:18,360 INFO | 18| ::[5]	train-auc:0.685133	test-auc:0.682716
18/08/25 10:40:18 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:18,360 INFO | 11| ::[5]	train-auc:0.685133	test-auc:0.682716
18/08/25 10:40:18 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:18,360 INFO | 12| ::[5]	train-auc:0.685133	test-auc:0.682716
18/08/25 10:40:18 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:18,361 INFO | 19| ::[5]	train-auc:0.685133	test-auc:0.682716
18/08/25 10:40:18 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:18,361 INFO | 17| ::[5]	train-auc:0.685133	test-auc:0.682716
18/08/25 10:40:18 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:18,361 INFO | 9| ::[5]	train-auc:0.685133	test-auc:0.682716
18/08/25 10:40:18 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:18,361 INFO | 6| ::[5]	train-auc:0.685133	test-auc:0.682716
18/08/25 10:40:18 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:18,362 INFO | 10| ::[5]	train-auc:0.685133	test-auc:0.682716
18/08/25 10:40:18 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:18,362 INFO | 5| ::[5]	train-auc:0.685133	test-auc:0.682716
18/08/25 10:40:18 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:18,362 INFO | 15| ::[5]	train-auc:0.685133	test-auc:0.682716
18/08/25 10:40:18 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:18,362 INFO | 13| ::[5]	train-auc:0.685133	test-auc:0.682716
18/08/25 10:40:18 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:18,363 INFO | 14| ::[5]	train-auc:0.685133	test-auc:0.682716
18/08/25 10:40:18 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:18,363 INFO | 3| ::[5]	train-auc:0.685133	test-auc:0.682716
18/08/25 10:40:18 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:18,363 INFO | 4| ::[5]	train-auc:0.685133	test-auc:0.682716
18/08/25 10:40:18 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:18,364 INFO | 8| ::[5]	train-auc:0.685133	test-auc:0.682716
18/08/25 10:40:18 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:40:18,364 INFO | 16| ::[5]	train-auc:0.685133	test-auc:0.682716

log2

18/08/25 10:46:51 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:51,636 INFO @tracker All of 20 nodes getting started
18/08/25 10:46:54 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:54,865 INFO | 4| ::	[0]	train-ks:0.213605	[0]	test-ks:0.229656
18/08/25 10:46:54 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:54,865 INFO | 14| ::	[0]	train-ks:0.216186	[0]	test-ks:0.214618
18/08/25 10:46:54 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:54,866 INFO | 9| ::	[0]	train-ks:0.218315	[0]	test-ks:0.209641
18/08/25 10:46:54 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:54,866 INFO | 7| ::	[0]	train-ks:0.223061	[0]	test-ks:0.226404
18/08/25 10:46:54 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:54,866 INFO | 6| ::	[0]	train-ks:0.219679	[0]	test-ks:0.212740
18/08/25 10:46:54 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:54,867 INFO | 8| ::	[0]	train-ks:0.217867	[0]	test-ks:0.207583
18/08/25 10:46:54 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:54,867 INFO | 2| ::	[0]	train-ks:0.222575	[0]	test-ks:0.207927
18/08/25 10:46:54 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:54,867 INFO | 10| ::	[0]	train-ks:0.220124	[0]	test-ks:0.214620
18/08/25 10:46:54 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:54,868 INFO | 1| ::	[0]	train-ks:0.217385	[0]	test-ks:0.225204
18/08/25 10:46:54 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:54,868 INFO | 0| ::	[0]	train-ks:0.220791	[0]	test-ks:0.208775
18/08/25 10:46:54 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:54,868 INFO | 11| ::	[0]	train-ks:0.216304	[0]	test-ks:0.194275
18/08/25 10:46:54 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:54,869 INFO | 16| ::	[0]	train-ks:0.222001	[0]	test-ks:0.234345
18/08/25 10:46:54 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:54,872 INFO | 17| ::	[0]	train-ks:0.211886	[0]	test-ks:0.222184
18/08/25 10:46:54 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:54,873 INFO | 5| ::	[0]	train-ks:0.212823	[0]	test-ks:0.258343
18/08/25 10:46:54 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:54,873 INFO | 13| ::	[0]	train-ks:0.214773	[0]	test-ks:0.232836
18/08/25 10:46:54 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:54,873 INFO | 3| ::	[0]	train-ks:0.212174	[0]	test-ks:0.230747
18/08/25 10:46:54 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:54,877 INFO | 12| ::	[0]	train-ks:0.206688	[0]	test-ks:0.222517
18/08/25 10:46:54 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:54,877 INFO | 15| ::	[0]	train-ks:0.218909	[0]	test-ks:0.225045
18/08/25 10:46:54 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:54,891 INFO | 18| ::	[0]	train-ks:0.212615	[0]	test-ks:0.232313
18/08/25 10:46:54 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:54,892 INFO | 19| ::	[0]	train-ks:0.214163	[0]	test-ks:0.224703
18/08/25 10:46:56 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:56,047 INFO | 2| ::	[1]	train-ks:0.246214	[1]	test-ks:0.230180
18/08/25 10:46:56 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:56,048 INFO | 1| ::	[1]	train-ks:0.242899	[1]	test-ks:0.253901
18/08/25 10:46:56 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:56,048 INFO | 6| ::	[1]	train-ks:0.239979	[1]	test-ks:0.235272
18/08/25 10:46:56 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:56,048 INFO | 4| ::	[1]	train-ks:0.242934	[1]	test-ks:0.261993
18/08/25 10:46:56 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:56,049 INFO | 11| ::	[1]	train-ks:0.237257	[1]	test-ks:0.215567
18/08/25 10:46:56 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:56,049 INFO | 5| ::	[1]	train-ks:0.234536	[1]	test-ks:0.284756
18/08/25 10:46:56 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:56,049 INFO | 10| ::	[1]	train-ks:0.245604	[1]	test-ks:0.242959
18/08/25 10:46:56 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:56,049 INFO | 8| ::	[1]	train-ks:0.240115	[1]	test-ks:0.237748
18/08/25 10:46:56 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:56,050 INFO | 7| ::	[1]	train-ks:0.246637	[1]	test-ks:0.233669
18/08/25 10:46:56 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:56,050 INFO | 3| ::	[1]	train-ks:0.239128	[1]	test-ks:0.262906
18/08/25 10:46:56 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:56,050 INFO | 14| ::	[1]	train-ks:0.239387	[1]	test-ks:0.230559
18/08/25 10:46:56 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:56,051 INFO | 9| ::	[1]	train-ks:0.247214	[1]	test-ks:0.240998
18/08/25 10:46:56 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:56,051 INFO | 0| ::	[1]	train-ks:0.246637	[1]	test-ks:0.229674
18/08/25 10:46:56 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:56,053 INFO | 17| ::	[1]	train-ks:0.233025	[1]	test-ks:0.250485
18/08/25 10:46:56 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:56,054 INFO | 15| ::	[1]	train-ks:0.242287	[1]	test-ks:0.244587
18/08/25 10:46:56 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:56,054 INFO | 12| ::	[1]	train-ks:0.228657	[1]	test-ks:0.260486
18/08/25 10:46:56 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:56,054 INFO | 18| ::	[1]	train-ks:0.239501	[1]	test-ks:0.245938
18/08/25 10:46:56 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:56,054 INFO | 13| ::	[1]	train-ks:0.240542	[1]	test-ks:0.252458
18/08/25 10:46:56 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:56,055 INFO | 16| ::	[1]	train-ks:0.248654	[1]	test-ks:0.252166
18/08/25 10:46:56 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:56,055 INFO | 19| ::	[1]	train-ks:0.238334	[1]	test-ks:0.245183
18/08/25 10:46:57 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:57,286 INFO | 17| ::	[2]	train-ks:0.245642	[2]	test-ks:0.264550
18/08/25 10:46:57 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:57,286 INFO | 13| ::	[2]	train-ks:0.258708	[2]	test-ks:0.269986
18/08/25 10:46:57 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:57,291 INFO | 4| ::	[2]	train-ks:0.254078	[2]	test-ks:0.265361
18/08/25 10:46:57 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:57,292 INFO | 0| ::	[2]	train-ks:0.258075	[2]	test-ks:0.242503
18/08/25 10:46:57 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:57,292 INFO | 3| ::	[2]	train-ks:0.252931	[2]	test-ks:0.277819
18/08/25 10:46:57 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:57,294 INFO | 1| ::	[2]	train-ks:0.255088	[2]	test-ks:0.257959
18/08/25 10:46:57 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:57,295 INFO | 6| ::	[2]	train-ks:0.257191	[2]	test-ks:0.249357
18/08/25 10:46:57 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:57,296 INFO | 11| ::	[2]	train-ks:0.255612	[2]	test-ks:0.232128
18/08/25 10:46:57 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:57,296 INFO | 5| ::	[2]	train-ks:0.246587	[2]	test-ks:0.300533
18/08/25 10:46:57 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:57,296 INFO | 7| ::	[2]	train-ks:0.257092	[2]	test-ks:0.249160
18/08/25 10:46:57 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:57,297 INFO | 9| ::	[2]	train-ks:0.262425	[2]	test-ks:0.259207
18/08/25 10:46:57 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:57,297 INFO | 2| ::	[2]	train-ks:0.257861	[2]	test-ks:0.239291
18/08/25 10:46:57 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:57,298 INFO | 8| ::	[2]	train-ks:0.254339	[2]	test-ks:0.241654
18/08/25 10:46:57 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:57,298 INFO | 10| ::	[2]	train-ks:0.261129	[2]	test-ks:0.253036
18/08/25 10:46:57 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:57,303 INFO | 18| ::	[2]	train-ks:0.259899	[2]	test-ks:0.252174
18/08/25 10:46:57 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:57,304 INFO | 12| ::	[2]	train-ks:0.237627	[2]	test-ks:0.270608
18/08/25 10:46:57 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:57,304 INFO | 16| ::	[2]	train-ks:0.260356	[2]	test-ks:0.254510
18/08/25 10:46:57 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:57,304 INFO | 14| ::	[2]	train-ks:0.255899	[2]	test-ks:0.241820
18/08/25 10:46:57 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:57,305 INFO | 19| ::	[2]	train-ks:0.256454	[2]	test-ks:0.247556
18/08/25 10:46:57 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:57,306 INFO | 15| ::	[2]	train-ks:0.258677	[2]	test-ks:0.257312
18/08/25 10:46:58 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:58,462 INFO | 11| ::	[3]	train-ks:0.259033	[3]	test-ks:0.240954
18/08/25 10:46:58 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:58,462 INFO | 10| ::	[3]	train-ks:0.269501	[3]	test-ks:0.263501
18/08/25 10:46:58 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:58,463 INFO | 1| ::	[3]	train-ks:0.259490	[3]	test-ks:0.266757
18/08/25 10:46:58 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:58,463 INFO | 3| ::	[3]	train-ks:0.262790	[3]	test-ks:0.274790
18/08/25 10:46:58 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:58,463 INFO | 5| ::	[3]	train-ks:0.255716	[3]	test-ks:0.305324
18/08/25 10:46:58 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:58,469 INFO | 12| ::	[3]	train-ks:0.247592	[3]	test-ks:0.270449
18/08/25 10:46:58 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:58,469 INFO | 17| ::	[3]	train-ks:0.249292	[3]	test-ks:0.270699
18/08/25 10:46:58 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:58,469 INFO | 8| ::	[3]	train-ks:0.263230	[3]	test-ks:0.252496
18/08/25 10:46:58 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:58,470 INFO | 16| ::	[3]	train-ks:0.268689	[3]	test-ks:0.266131
18/08/25 10:46:58 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:58,470 INFO | 4| ::	[3]	train-ks:0.261413	[3]	test-ks:0.260258
18/08/25 10:46:58 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:58,470 INFO | 9| ::	[3]	train-ks:0.268698	[3]	test-ks:0.255754
18/08/25 10:46:58 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:58,471 INFO | 14| ::	[3]	train-ks:0.261425	[3]	test-ks:0.241839
18/08/25 10:46:58 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:58,471 INFO | 2| ::	[3]	train-ks:0.265987	[3]	test-ks:0.253326
18/08/25 10:46:58 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:58,471 INFO | 7| ::	[3]	train-ks:0.263494	[3]	test-ks:0.260136
18/08/25 10:46:58 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:58,471 INFO | 6| ::	[3]	train-ks:0.264653	[3]	test-ks:0.256068
18/08/25 10:46:58 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:58,472 INFO | 0| ::	[3]	train-ks:0.264093	[3]	test-ks:0.251171
18/08/25 10:46:58 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:58,472 INFO | 15| ::	[3]	train-ks:0.265934	[3]	test-ks:0.260121
18/08/25 10:46:58 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:58,472 INFO | 19| ::	[3]	train-ks:0.261076	[3]	test-ks:0.256076
18/08/25 10:46:58 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:58,472 INFO | 13| ::	[3]	train-ks:0.266548	[3]	test-ks:0.280569
18/08/25 10:46:58 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:58,473 INFO | 18| ::	[3]	train-ks:0.263675	[3]	test-ks:0.255474
18/08/25 10:46:59 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:59,581 INFO | 2| ::	[4]	train-ks:0.267461	[4]	test-ks:0.263209
18/08/25 10:46:59 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:59,582 INFO | 5| ::	[4]	train-ks:0.259083	[4]	test-ks:0.302731
18/08/25 10:46:59 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:59,583 INFO | 10| ::	[4]	train-ks:0.270736	[4]	test-ks:0.263437
18/08/25 10:46:59 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:59,584 INFO | 9| ::	[4]	train-ks:0.270957	[4]	test-ks:0.263821
18/08/25 10:46:59 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:59,584 INFO | 14| ::	[4]	train-ks:0.267541	[4]	test-ks:0.249762
18/08/25 10:46:59 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:59,585 INFO | 6| ::	[4]	train-ks:0.266413	[4]	test-ks:0.263801
18/08/25 10:46:59 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:59,585 INFO | 1| ::	[4]	train-ks:0.265695	[4]	test-ks:0.268344
18/08/25 10:46:59 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:59,585 INFO | 0| ::	[4]	train-ks:0.270916	[4]	test-ks:0.252790
18/08/25 10:46:59 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:59,585 INFO | 8| ::	[4]	train-ks:0.266864	[4]	test-ks:0.256972
18/08/25 10:46:59 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:59,586 INFO | 3| ::	[4]	train-ks:0.265028	[4]	test-ks:0.274010
18/08/25 10:46:59 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:59,586 INFO | 11| ::	[4]	train-ks:0.261031	[4]	test-ks:0.242517
18/08/25 10:46:59 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:59,586 INFO | 4| ::	[4]	train-ks:0.265362	[4]	test-ks:0.269869
18/08/25 10:46:59 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:59,586 INFO | 7| ::	[4]	train-ks:0.270551	[4]	test-ks:0.270879
18/08/25 10:46:59 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:59,587 INFO | 15| ::	[4]	train-ks:0.269797	[4]	test-ks:0.265940
18/08/25 10:46:59 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:59,587 INFO | 12| ::	[4]	train-ks:0.251769	[4]	test-ks:0.265825
18/08/25 10:46:59 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:59,587 INFO | 13| ::	[4]	train-ks:0.270275	[4]	test-ks:0.278121
18/08/25 10:46:59 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:59,587 INFO | 16| ::	[4]	train-ks:0.277258	[4]	test-ks:0.273662
18/08/25 10:46:59 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:59,588 INFO | 19| ::	[4]	train-ks:0.264914	[4]	test-ks:0.263789
18/08/25 10:46:59 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:59,588 INFO | 17| ::	[4]	train-ks:0.253500	[4]	test-ks:0.277332
18/08/25 10:46:59 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:46:59,592 INFO | 18| ::	[4]	train-ks:0.265297	[4]	test-ks:0.255897
18/08/25 10:47:00 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:00,727 INFO | 0| ::	[5]	train-ks:0.272829	[5]	test-ks:0.257225
18/08/25 10:47:00 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:00,728 INFO | 2| ::	[5]	train-ks:0.271210	[5]	test-ks:0.270738
18/08/25 10:47:00 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:00,728 INFO | 10| ::	[5]	train-ks:0.273149	[5]	test-ks:0.264937
18/08/25 10:47:00 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:00,728 INFO | 5| ::	[5]	train-ks:0.260996	[5]	test-ks:0.305599
18/08/25 10:47:00 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:00,728 INFO | 9| ::	[5]	train-ks:0.273260	[5]	test-ks:0.265579
18/08/25 10:47:00 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:00,729 INFO | 15| ::	[5]	train-ks:0.273003	[5]	test-ks:0.267115
18/08/25 10:47:00 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:00,729 INFO | 12| ::	[5]	train-ks:0.254419	[5]	test-ks:0.267850
18/08/25 10:47:00 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:00,729 INFO | 3| ::	[5]	train-ks:0.268860	[5]	test-ks:0.278649
18/08/25 10:47:00 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:00,730 INFO | 17| ::	[5]	train-ks:0.257118	[5]	test-ks:0.276053
18/08/25 10:47:00 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:00,730 INFO | 7| ::	[5]	train-ks:0.270454	[5]	test-ks:0.281948
18/08/25 10:47:00 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:00,730 INFO | 19| ::	[5]	train-ks:0.266555	[5]	test-ks:0.264425
18/08/25 10:47:00 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:00,731 INFO | 4| ::	[5]	train-ks:0.266903	[5]	test-ks:0.269251
18/08/25 10:47:00 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:00,731 INFO | 1| ::	[5]	train-ks:0.267439	[5]	test-ks:0.272839
18/08/25 10:47:00 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:00,731 INFO | 11| ::	[5]	train-ks:0.264046	[5]	test-ks:0.247816
18/08/25 10:47:00 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:00,731 INFO | 14| ::	[5]	train-ks:0.268987	[5]	test-ks:0.252603
18/08/25 10:47:00 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:00,732 INFO | 8| ::	[5]	train-ks:0.268534	[5]	test-ks:0.257021
18/08/25 10:47:00 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:00,732 INFO | 18| ::	[5]	train-ks:0.268315	[5]	test-ks:0.267191
18/08/25 10:47:00 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:00,732 INFO | 16| ::	[5]	train-ks:0.279828	[5]	test-ks:0.270923
18/08/25 10:47:00 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:00,732 INFO | 6| ::	[5]	train-ks:0.269077	[5]	test-ks:0.273271
18/08/25 10:47:00 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:00,733 INFO | 13| ::	[5]	train-ks:0.272555	[5]	test-ks:0.285220
18/08/25 10:47:01 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:01,863 INFO | 6| ::	[6]	train-ks:0.273417	[6]	test-ks:0.270661
18/08/25 10:47:01 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:01,863 INFO | 0| ::	[6]	train-ks:0.277123	[6]	test-ks:0.264478
18/08/25 10:47:01 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:01,863 INFO | 2| ::	[6]	train-ks:0.273731	[6]	test-ks:0.273107
18/08/25 10:47:01 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:01,864 INFO | 7| ::	[6]	train-ks:0.275987	[6]	test-ks:0.275554
18/08/25 10:47:01 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:01,864 INFO | 9| ::	[6]	train-ks:0.278053	[6]	test-ks:0.271328
18/08/25 10:47:01 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:01,864 INFO | 11| ::	[6]	train-ks:0.266321	[6]	test-ks:0.250843
18/08/25 10:47:01 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:01,865 INFO | 3| ::	[6]	train-ks:0.271265	[6]	test-ks:0.281224
18/08/25 10:47:01 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:01,865 INFO | 16| ::	[6]	train-ks:0.279892	[6]	test-ks:0.272915
18/08/25 10:47:01 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:01,865 INFO | 14| ::	[6]	train-ks:0.273903	[6]	test-ks:0.252065
18/08/25 10:47:01 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:01,866 INFO | 1| ::	[6]	train-ks:0.269724	[6]	test-ks:0.284952
18/08/25 10:47:01 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:01,866 INFO | 18| ::	[6]	train-ks:0.271505	[6]	test-ks:0.266881
18/08/25 10:47:01 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:01,866 INFO | 10| ::	[6]	train-ks:0.277666	[6]	test-ks:0.271212
18/08/25 10:47:01 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:01,866 INFO | 8| ::	[6]	train-ks:0.272944	[6]	test-ks:0.258046
18/08/25 10:47:01 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:01,867 INFO | 13| ::	[6]	train-ks:0.275404	[6]	test-ks:0.290394
18/08/25 10:47:01 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:01,867 INFO | 17| ::	[6]	train-ks:0.262216	[6]	test-ks:0.272610
18/08/25 10:47:01 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:01,867 INFO | 19| ::	[6]	train-ks:0.269798	[6]	test-ks:0.263180
18/08/25 10:47:01 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:01,867 INFO | 12| ::	[6]	train-ks:0.256750	[6]	test-ks:0.272028
18/08/25 10:47:01 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:01,868 INFO | 15| ::	[6]	train-ks:0.274870	[6]	test-ks:0.272556
18/08/25 10:47:01 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:01,868 INFO | 5| ::	[6]	train-ks:0.264574	[6]	test-ks:0.311568
18/08/25 10:47:01 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:01,868 INFO | 4| ::	[6]	train-ks:0.268094	[6]	test-ks:0.277722
18/08/25 10:47:03 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:03,061 INFO | 0| ::	[7]	train-ks:0.282139	[7]	test-ks:0.267584
18/08/25 10:47:03 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:03,061 INFO | 4| ::	[7]	train-ks:0.273064	[7]	test-ks:0.276699
18/08/25 10:47:03 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:03,061 INFO | 6| ::	[7]	train-ks:0.272757	[7]	test-ks:0.267651
18/08/25 10:47:03 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:03,062 INFO | 7| ::	[7]	train-ks:0.280499	[7]	test-ks:0.280445
18/08/25 10:47:03 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:03,062 INFO | 5| ::	[7]	train-ks:0.267489	[7]	test-ks:0.304914
18/08/25 10:47:03 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:03,062 INFO | 11| ::	[7]	train-ks:0.271020	[7]	test-ks:0.258090
18/08/25 10:47:03 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:03,063 INFO | 17| ::	[7]	train-ks:0.266969	[7]	test-ks:0.281476
18/08/25 10:47:03 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:03,063 INFO | 2| ::	[7]	train-ks:0.276945	[7]	test-ks:0.272697
18/08/25 10:47:03 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:03,063 INFO | 14| ::	[7]	train-ks:0.276267	[7]	test-ks:0.259209
18/08/25 10:47:03 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:03,063 INFO | 1| ::	[7]	train-ks:0.274062	[7]	test-ks:0.287360
18/08/25 10:47:03 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:03,064 INFO | 18| ::	[7]	train-ks:0.278474	[7]	test-ks:0.265676
18/08/25 10:47:03 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:03,064 INFO | 3| ::	[7]	train-ks:0.274288	[7]	test-ks:0.288637
18/08/25 10:47:03 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:03,064 INFO | 9| ::	[7]	train-ks:0.281730	[7]	test-ks:0.274905
18/08/25 10:47:03 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:03,064 INFO | 10| ::	[7]	train-ks:0.280972	[7]	test-ks:0.272569
18/08/25 10:47:03 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:03,065 INFO | 8| ::	[7]	train-ks:0.278896	[7]	test-ks:0.258591
18/08/25 10:47:03 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:03,065 INFO | 12| ::	[7]	train-ks:0.262557	[7]	test-ks:0.276044
18/08/25 10:47:03 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:03,065 INFO | 15| ::	[7]	train-ks:0.281830	[7]	test-ks:0.279164
18/08/25 10:47:03 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:03,066 INFO | 16| ::	[7]	train-ks:0.282905	[7]	test-ks:0.273821
18/08/25 10:47:03 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:03,066 INFO | 19| ::	[7]	train-ks:0.272972	[7]	test-ks:0.264290
18/08/25 10:47:03 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:03,066 INFO | 13| ::	[7]	train-ks:0.277921	[7]	test-ks:0.298307
18/08/25 10:47:04 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:04,200 INFO | 2| ::	[8]	train-ks:0.280560	[8]	test-ks:0.276503
18/08/25 10:47:04 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:04,201 INFO | 0| ::	[8]	train-ks:0.284726	[8]	test-ks:0.275615
18/08/25 10:47:04 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:04,201 INFO | 6| ::	[8]	train-ks:0.275541	[8]	test-ks:0.271072
18/08/25 10:47:04 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:04,201 INFO | 4| ::	[8]	train-ks:0.274850	[8]	test-ks:0.277350
18/08/25 10:47:04 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:04,202 INFO | 11| ::	[8]	train-ks:0.273702	[8]	test-ks:0.261809
18/08/25 10:47:04 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:04,202 INFO | 9| ::	[8]	train-ks:0.286633	[8]	test-ks:0.272326
18/08/25 10:47:04 INFO RabitTracker$TrackerProcessLogger: 2018-08-25 10:47:04,202 INFO | 12| ::	[8]	train-ks:0.264409	[8]	test-ks:0.276489

@CodingCat
Copy link
Member

it looks like the customized metrics is not synced properly in xgboost, I need to take a chance to look into this,

is turning off early stopping an option for you?

@wsobra
Copy link
Author

wsobra commented Aug 26, 2018

@CodingCat, Thank you for your help. Unfortunately, I need turning on early stopping to optimize the parameters according to certain metrics and to avoid overfitting.

@CodingCat
Copy link
Member

@wsobra as it takes time for me to debug and fix this, can you use paramGrid in MLLIB to search the best configuration of numRounds for you?

@hcho3
Copy link
Collaborator

hcho3 commented Mar 13, 2019

@CodingCat Any updates on custom evaluation on XGBoost4J-Spark?

@CodingCat
Copy link
Member

Next release..

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
3 participants