怎么把有2列纳秒时间戳的文本导入DolphinDB database分布式表

文本文件样本数据如下所示,其中SendingTimeInNano和origSendingTimeInNano是纳秒时间戳:

SendingTimeInNano#securityID#origSendingTimeInNano#bidSize
1579510735948574000#27522#1575277200049000000#1
1579510735948606000#27522#1575277200049000000#2
1579510735948649000#27702#1575277200049000000#3
1579510735948676000#27702#1575277200049000000#4
1579510735948711000#30495#1575277200049000000#5
1579510735948762000#29088#1575277200049000000#6
...

dolphindb database分布式表建表脚本如下,SendingTimeInNano和securityID是2个分区字段:

x=20*seq(500,2500);

dbSendingTimeInNano = database(, VALUE,  2020.01.11..2020.01.22);

dbSecurityIDRange = database(, RANGE, x);

db = database("dfs://dolphinDBHYTEST3", COMPO, [dbSendingTimeInNano, dbSecurityIDRange]);

nameCol = `SendingTimeInNano`securityID`origSendingTimeInNano`bidSize;

typeCol = [`NANOTIMESTAMP,`INT,`NANOTIMESTAMP,`INT];

schemaTb = table(1:0,nameCol,typeCol);

db = database("dfs://dolphinDBHYTEST3");

nx = db.createPartitionedTable(schemaTb, `nx, `SendingTimeInNano`securityID);

请教怎样才能高效地把文本数据导入上述dolphindb分布式表?

阅读 1.6k
1 个回答

可试试下面代码:

def dataTransform(mutable t){

return t.replaceColumn!(`SendingTimeInNano, nanotimestamp(t.SendingTimeInNano)).replaceColumn!(`origSendingTimeInNano, nanotimestamp(t.origSendingTimeInNano))

}

pt=loadTextEx(dbHandle=db,tableName=`nx , partitionColumns=`SendingTimeInNano`securityID,filename="nx.txt",delimiter='#',transform=dataTransform);

更进一步的说明请参阅https://github.com/dolphindb/...

撰写回答
你尚未登录,登录后可以
  • 和开发者交流问题的细节
  • 关注并接收问题和回答的更新提醒
  • 参与内容的编辑和改进,让解决方法与时俱进