Monarch geneset OGS2.0

DPOGS204751
TranscriptDPOGS204751-TA2109 bp
ProteinDPOGS204751-PA702 aa
Genomic positionDPSCF300231 - 212844-221326
RNAseq coverage37x (Rank: top 73%)
Annotation
HeliconiusHMEL0114721e-1337.04% 
Bombyx% 
Drosophila% 
EBI UniRef50UniRef50_O014181e-1432.17%Gag protein n=7 Tax=Endopterygota RepID=O01418_BOMMO
NCBI RefSeq%
NCBI nr blastpgi|20552754e-1432.17%Gag protein [Bombyx mori]
NCBI nr blastxgi|20552754e-2528.81%Gag protein [Bombyx mori]
Group
KEGG pathway 
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204751-TA
ATGGGTTGCGACGCGGAAACCTCCTCGAGCGCCTGGTTGTTACCGGTCCCTCACGGATTACGGTCCGGGGGGAAGGGTAGCAGCTGGGCCTGGTGCAATCTGGGCAACGACGGAGAACGATGTTTGCATAGCAGGCGTCTTTCTTCGGCGGCTAATATCCGCAGGGGGGGGATAGTCTCATGGTCCGGATGCGTGGGTTTCGTTTTCGGAGCCTACGCAATTGGTTCGTGGGACTCTAGGGGTGGCGCGGTCGTGTGGCCGTATGGCCTGGCCGGTGAGTGCGGGCCGGCCTTGTGGTCGTTAGACTCTGGGCTGGCCGGCGTGTGGGTCGGAGAGGGGCCCGAGGACCCCCATCCGTGGGCTAGGGCTCCTTCTCGGAATCTGTGCGAAGGGGCTGGCGTGCCGGTCTCGGAACTCTGGGGACGCAGGACACTCGGCGGTAATACCACTCGTCAGCTCCTCTGCAGCAGGGGACTCCGGCGACGTGGTATCCAAGAGTGTAGTGGTGGTGAGGGCGTGCCATTTTTGGCGTGCCGTTTTCTGGTTCCCCAGCTCTCCTGTATCAGGAGACCCCGGGGATACCGGATCTGTCACTTCGTGACGTTGCCGAGTGGATTCGTCTGCTCTCGGCACAACCCGCTCCCCAGCTCCCCTGTATCAGGGGACCCCGGGGACGTGGGATCGGGCGGCGGGCGGTCTTCTAGACTACTCGCCGGCCAGACCGTAACCCTCATTGAGAGGACACGGGGGCGTCAGCTGCGAGGAGTAAACCTCTATAAAAAATCCCCAAATTCTCCTAGTTGCGGGGCGCGGCTAGGGGATGCTTCTTCGGGAGCGGCTGGGGGGTATCTCAGCCACCGAGCGACCAAGGGGCTCCCGCCCCCGAGGCGGGACGACGATGACTCTCTGGGACGAGGTGCAGTGGGCACGGGACCTGGCTTGGGAAAAGGACACCGATCAGCAAAAGATTTGGACAAGGTGATGGTGGTAAGCTCTGACGAGGAGCCTGTAGACGCGTCGGCGGTGCGACCAGTGGCGCGCCAGCCACTAGCGTCCAACGGTGAGAAGGGAAGAGCAACCAGAAGGAGCCCGCGTACAACGAGCGGAAGTGAGATGGAGACGGAGGAGACGCGCTCCGCCTTCTTCGCCGGTACGCCGACGAGCCTGGCGCCTCTCCGGAAGCGGCCAGCGACAAGAAGACAACCAGGCGGGAGTTCGTCTGGCGGAAGTGACAAGGCTTCCTTCGCTACTGCGGTGAAGAGAGGACGGGCAGTTGAGGAAGGCGAATCGAACTCGGAGGAGGAAAACGTGGCGAGGTCGACGCGTCGGGTCGAGGTGGCCCTTTCCTCGGTTAAGACGCTGCCAGCCTCGTGCCTCGCGAAAGAGATGGAGAGGGCCCTGAGCGTCATAGTCGACGTGGCCCTCAAATCCAAGAATCTGAAGGGCGGATGCGTCAGGGCATTGAAGACGTCGGCGGCACTCCTGGGGGAGGCAAAAGAGATTCTCCTGCAGCGGACCAGCGGCGAGGAGAATGAGATTCTCCGAGCCCGGCTAGAGGAAGAAAGGAAGAAGAGCTCGCTGCTGGAGAAGGAGCTGGGGCTCCTGAGAGAGGGGCAGGCCCGCTTGCGGGCAGACATGGACCTGCTCGCCACTGCCCCGAAACCAGCACGAGACGAGAAGAGCGAGGAGGAGCTTCGCGGGTCCCTCATGAGGGACCTAGGTGCCATGATGGACGCGAAGCTCCAGGGGATCGCAGACCGGCTGCTCCCCGAAAAGCGCCTGAGGCCGCCCCTAGCGGCGGACAAGAGGCCACCCCCAGCGCCTGCGTCGGCTGCTGTGGCTGAGCCGGCAGGTAGAGTGGCGAGCAGGAAAAAGAACGGTGCCACAAGAGAACAGGAGAAGACGGCGAGACCTCTACCCCCGCCGCCTCCATCCATGGACAAGACATGGACGGAGGTGCGTCCACCTTCACCCGCCTCTGGGGGTGTGCGGCGGGCAGCCAGCGCGGGGGCGCTGCCGCCTGCATCACAGCGCACACCTCTAGCGATGGCCAGATACGGCCGTGCGCTCCGGCCACAACAAAGCCCTCCCCCACCCGGGGGGGCGTGA

Protein sequence:

>DPOGS204751-PA
MGCDAETSSSAWLLPVPHGLRSGGKGSSWAWCNLGNDGERCLHSRRLSSAANIRRGGIVSWSGCVGFVFGAYAIGSWDSRGGAVVWPYGLAGECGPALWSLDSGLAGVWVGEGPEDPHPWARAPSRNLCEGAGVPVSELWGRRTLGGNTTRQLLCSRGLRRRGIQECSGGEGVPFLACRFLVPQLSCIRRPRGYRICHFVTLPSGFVCSRHNPLPSSPVSGDPGDVGSGGGRSSRLLAGQTVTLIERTRGRQLRGVNLYKKSPNSPSCGARLGDASSGAAGGYLSHRATKGLPPPRRDDDDSLGRGAVGTGPGLGKGHRSAKDLDKVMVVSSDEEPVDASAVRPVARQPLASNGEKGRATRRSPRTTSGSEMETEETRSAFFAGTPTSLAPLRKRPATRRQPGGSSSGGSDKASFATAVKRGRAVEEGESNSEEENVARSTRRVEVALSSVKTLPASCLAKEMERALSVIVDVALKSKNLKGGCVRALKTSAALLGEAKEILLQRTSGEENEILRARLEEERKKSSLLEKELGLLREGQARLRADMDLLATAPKPARDEKSEEELRGSLMRDLGAMMDAKLQGIADRLLPEKRLRPPLAADKRPPPAPASAAVAEPAGRVASRKKNGATREQEKTARPLPPPPPSMDKTWTEVRPPSPASGGVRRAASAGALPPASQRTPLAMARYGRALRPQQSPPPPGGA-