Monarch geneset OGS2.0

DPOGS200602
TranscriptDPOGS200602-TA1389 bp
ProteinDPOGS200602-PA462 aa
Genomic positionDPSCF300076 - 463513-496941
RNAseq coverage659x (Rank: top 19%)
Annotation
HeliconiusHMEL0010092e-15787.01% 
BombyxBGIBMGA011292-TA1e-6896.03% 
Drosophilabves-PB8e-8263.93% 
EBI UniRef50UniRef50_E2B1Q17e-9168.56%Blood vessel epicardial substance n=12 Tax=Endopterygota RepID=E2B1Q1_CAMFO
NCBI RefSeqXP_001606395.13e-9360.27%PREDICTED: similar to ENSANGP00000020411 [Nasonia vitripennis]
NCBI nr blastpgi|1565488726e-9260.27%PREDICTED: blood vessel epicardial substance-like isoform 1 [Nasonia vitripennis]
NCBI nr blastxgi|1565488728e-9060.48%PREDICTED: blood vessel epicardial substance-like isoform 1 [Nasonia vitripennis]
Group
Gene OntologyGO:00160201.4e-126membrane
KEGG pathway 
InterPro domain[72-293] IPR0069161.4e-126Popeye protein
Orthology groupMCL16558 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200602-TA
ATGTTGTTTCTCAGTGCCTTAGCGATCGGGATGATTACTGGTTTACAGAATGTCTCAGAGGTGCCCACGACGGAGAGTGACGTAGAGTCAACCACAGACATGGGGTTGTTGGAGGCTACATCGAGGGCTCCCACCCCTAATGATTTTGTGAATGTTACAATACAAGAGAGACCCTCCATGTGGGATGTGTACTTAAATCACTGCCCGAAGTGGCGTCCCATTAACCATATATTTTTTCAGACGGCCAATGTGTTCTTTTTATTGTCGTTCCTTGCACCGCACACTCCAAGTGGTCTTATATGGTTACGAGTAGCCCTTATCTTGGGGTGCTCATTCTCAGGATTGTGGGCTTGGAGTGTTGAGTGTTACCTGGACGCTGTTGTGTGGAACTGTGTGTTTATTGTCATCAACTTTGTATACTTCTCCGTGCAGTTCTACTTATTGAGGCCAATAAAATTCCATAAGGATATTGAGGAGGTTTACCTGGCGTTGTTCAAGCCGCTGCGAGTGTCCCGCCTTCAGTTCCGTCGCGTGTTGATGTGTATGCGCAACGTGAGACAACTCAAGTGTCACGAACTGTACGCCCACGAGAAGGTCACCAAGGTGGACAGTCTCTCGCTGGTGCTCTCCGGAAAGTTGGTGGTATCTCAGAACCAGCGAGCCCTGCATATCGTGTTCCCACACCACTTCCTGGACTCTCCCGAGTGGTTTGGGGTTTCGACGGATGAGTTCTTCCAGGTGTCTATAATGGCGATGGAGGAGTCTCGTGTGTTGGTTTGGCATCGCGACAAGTTGAAGCTGTCCATCATGTCGGACGGATTCTTGCAGGCTGTGTTCGATCACATTCTAGGGAGGGACGTCGTTCATAAACTCATGCAGTTTGCAGCTAGTTTGACGTTATTAAACGCGGGGTTACATGAACCGTTGACCTTCAAGAGGTTCAGGTTGGTGGTATCTCAGAACCAGCGAGCCCTGCATATCGTGTTCCCACACCACTTCCTGGACTCTCCCGAGTGGTTTGGGGTTTCGACGGATGAGTTCTTCCAGGTGTCTATAATGGCGATGGAGGAGTCTCGTGTGTTGGTTTGGCATCGCGACAAGTTGAAGCTGTCCATCATGTCGGACGGATTCTTGCAGGCTGTGTTCGATCACATTCTAGGGAGGGACGTCGTTCATAAACTCATGCAGGTGAGCGAAACAATGTCTGTCAGTAACGGTCACTTACCTAACAGCTATGAGGAGGGAGAGGATAAACCCATGCTGGTTGTGAAGAAAGCTGGTGATGGACCTGGAATCACCGCGTTATTGAACAGACAGCTGCAAGCGACAGATCCGAACGCGTGGCGTTTAGGACGAATCGACGAAACAGACCACGAAACACCCGTTTGA

Protein sequence:

>DPOGS200602-PA
MLFLSALAIGMITGLQNVSEVPTTESDVESTTDMGLLEATSRAPTPNDFVNVTIQERPSMWDVYLNHCPKWRPINHIFFQTANVFFLLSFLAPHTPSGLIWLRVALILGCSFSGLWAWSVECYLDAVVWNCVFIVINFVYFSVQFYLLRPIKFHKDIEEVYLALFKPLRVSRLQFRRVLMCMRNVRQLKCHELYAHEKVTKVDSLSLVLSGKLVVSQNQRALHIVFPHHFLDSPEWFGVSTDEFFQVSIMAMEESRVLVWHRDKLKLSIMSDGFLQAVFDHILGRDVVHKLMQFAASLTLLNAGLHEPLTFKRFRLVVSQNQRALHIVFPHHFLDSPEWFGVSTDEFFQVSIMAMEESRVLVWHRDKLKLSIMSDGFLQAVFDHILGRDVVHKLMQVSETMSVSNGHLPNSYEEGEDKPMLVVKKAGDGPGITALLNRQLQATDPNAWRLGRIDETDHETPV-