作者hardcover (我要去流浪)
看板RegExp
标题[问题] 算蛋白质长度
时间Tue Aug 26 08:48:26 2008
我有一个档案像这样
>sp|P15711|104K_THEPA 104 kDa microneme/rhoptry antigen OS=Theileria parva GN=TP04_0437 PE=2 SV=100
MKFLILLFNILCLFPVLAADNHGVGPQGASGVDPITFDINSNQTGPAFLTAVEMAGVKYL
QVQHGSNVNIHRLVEGNVVIWENASTPLYTGAIVTNNDGPYMAYVEVLGDPNLQFFIKSG
DAWVTLSEHEYLAKLQEIRQAVHIESVFSLNMAFQLENNKYEVETHAKNGANMVTFIPRN
GHICKMVYHKNVRIYKATGNDTVTSVVGFFRGLRLLLINVFSIDDNGMMSNRYFQHVDDK
>sp...
DAWVTLSEHEYLAKLQEIRQAVHIESVFSLNMAFQLENNKYEVETHAKNGANMVTFIPRN
...
>sp...
GHICKMVYHKNVRIYKATGNDTVTSVVGFFRGLRLLLINVFSIDDNGMMSNRYFQHVDDK
...
>sp...
FL...
...
每个蛋白质都由 >sp 隔开,我想算中间那段有多少char,
不知怎麽下 expression ?
thanks
--
※ 发信站: 批踢踢实业坊(ptt.cc)
◆ From: 140.114.71.98
1F:推 PsMonkey:RE 没办法算长度吧? 08/26 09:29
2F:→ hardcover:喔喔,本来是想说看能不能用一些 linux 上的ultilities 08/27 10:48
3F:→ hardcover:就把答案凑出来。後来还是要写 program 08/27 10:49