如何在其他行之前放置字符串“ title”,直到新的字符串“ title”。用awk,sed,perl等解决的递归问题

马蒂亚斯·加拉瓦利亚(Matias Garavaglia)

我有一个txt文件,其中包含超过一百万行的下一个内容(它是Match_n和“ cggggg”之间的标签):

Sequence_1
Match_1 cggggg
Match_2 gggggc
Match_3 ggggcc
Match_4 cgggcc
Match_5 agggca
Match_6 agggta
Sequence_2
Match_1 tgggca
Match_2 aggggg
Match_3 gggggc
Match_4 ggggca
Sequence_3
Match_1 cggggt
Match_2 ggggtt
Match_3 tgggga
Match_4 ggggac
Match_5 cggggc

我需要以下格式:

Sequence_1  Match_1 cggggg
Sequence_1  Match_2 gggggc
Sequence_1  Match_3 ggggcc
Sequence_1  Match_4 cgggcc
Sequence_1  Match_5 agggca
Sequence_1  Match_6 agggta
Sequence_2  Match_1 tgggca
Sequence_2  Match_2 aggggg
Sequence_2  Match_3 gggggc
Sequence_2  Match_4 ggggca
Sequence_3  Match_1 cggggt
Sequence_3  Match_2 ggggtt
Sequence_3  Match_3 tgggga
Sequence_3  Match_4 ggggac
Sequence_3  Match_5 cggggc

更多信息:有10.000个“ Sequence_N”,每个都有可变数量的“ Match_n cggggc”

谢谢!!

埃德·莫顿
awk -v OFS='\t' 'NF==1{seq=$0; next} {print seq, $0}' file

本文收集自互联网,转载请注明来源。

如有侵权,请联系 [email protected] 删除。

编辑于
0

我来说两句

0 条评论
登录 后参与评论

相关文章