Thanks Malcolm,
You are very right - some logic blocked the init function and caused segmentation fault .
not directly related :
Currently there is any problem to add second gpu ?
I plan to let TD use one GPU (device 0 )
while it will do the other external cuda function stuff with device (1)