1
因此,我對編程非常陌生,對任何編程語言都不是很熟練。我購買了一本關於生物學家編程的書,我已經摸索了一些東西。我想要:從文件中獲取序列並從中找到並提取可變區域。下面我的代碼:DNA序列操作
**
#!/usr/bin/python
#for extracting GAA sequences
import os
import sys
import re
#opens sequence file and defines it as reps
reps = open('142sequences.txt')
#defining what to read
line = reps.readlines()
#defines what we are looking for in rep lines
for line in reps:
sear = re.search(r"C[A]{2,}G[ATCG]{17, 2700}AAT[A]{2,4}G[A]{2,}", reps)
if sear:
repeats = sear.group()
print(repeats)
else:
print('Not Recognized')
** 我得到什麼回報。請幫助
謝謝!還在搞清楚 –