Monarch geneset OGS2.0

DPOGS211332
TranscriptDPOGS211332-TA2850 bp
ProteinDPOGS211332-PA949 aa
Genomic positionDPSCF300125 + 340772-350745
RNAseq coverage72x (Rank: top 66%)
Annotation
HeliconiusHMEL0052895e-15358.76% 
BombyxBGIBMGA005166-TA3e-12652.83% 
Drosophila% 
EBI UniRef50UniRef50_UPI00022478372e-1238.30%UPI0002247837 related cluster n=1 Tax=unknown RepID=UPI0002247837
NCBI RefSeqXP_001656570.12e-1227.10%hypothetical protein AaeL_AAEL013285 [Aedes aegypti]
NCBI nr blastpgi|3504223104e-1242.68%PREDICTED: hypothetical protein LOC100740110 [Bombus impatiens]
NCBI nr blastxgi|3287849574e-2132.61%PREDICTED: hypothetical protein LOC725795 [Apis mellifera]
Group
KEGG pathway 
Orthology groupMCL25107 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211332-TA
ATGGCAGTTGAGAGTGTCTCAGGGGGCTGCAGTAGTGAAGTAAATATGTTCATCCAAGATAGTTCTGGTGATGAACAGTCATATATATCACATATTAGTGATAGCTTCTTCCATACCAAATTTAAAATAGATCCAAGTGAATTATACACTTTTCACGATTCCGATGTCATTGCTGGAGAGATTACTGTAAGTCATACAGATGATAATTACCTTTTTCCGGAGAGCTCAGAATACAAATCAAAGTTCTTGGGGAGTTCATTAAAGGAAATTGATTTATTGGGTTCCTTAACTAATGGTGAGGTAAATAATGATAAAAAACACACTGTGACAAATGGTCACTATTCTAAAATAATAGATGATATTGCCGTAAATGAAACAGAATCAAAACCGATCCCCAAAGGTCAAAAACCGAGGAATTCTGTACAGAGTCATGACAAAGTAAAACATAACATCTCCAGCCCTACAGTTTTTAACAATACAAGTCTATCTGCAAACAATAGTAAGACAGTTGTGAGTAACGGCAAGAATATTGAAAAGACTGCCCCTGCAATATCTCCGTCTGAAACATTTTTAGATGTATTTAAAAGAGAACAAGGTGTGACAGAGAATACTACAGTTAAAAATGAGCCAACCCTAACACCAATTAAACCACCGATACCAATAGCCAAAAAGCAGACTTCTGGTAAAACCAAATCCGTAACTCCATCCAAGCCCCGTAAAGGTCGAGGGCCTACGATTCATGAAGCACTGCAACGGATACCATTGCAACGGAATGCTGTTGCCTTAAAAAATACAAGATGGCAGGCGCCCGGGGAGGAAATGTTCCAATGTGGACCTCTGAGTGAAAAAAAAGTTAATGCCTTCCGACAACTGGACACGAGCAGTAGTGATGATGATAATATGCCAGAATTTGAAATTGGTGTGGGTCCGGTATGTGTGGAGGAGAGTGCGGCGGAGTCCCGCGGGGCCCGTCTGGCGCTCCGCAGCGCGGCCATGAGAAGACACGTGGTCAGGGCTGCCACGAGCCTGAAACTAAGCAAGGATCACAGCGAGCTCAGGGCTCTGACTTCAATGATCAAGAAACAATTGAGTGGCCAGAGCTCTACATGTCGTCTTCCGCCACAAACCTTGAACTATTTACTGGCTAAGCGCGGTCTCTCTGATATACTGAGCCAACAGGAACGCAGGTGGCTTAGACAAAGCGGCTGGTCAGGCGGTGAGAGTATTAAATCTGAATCATCGACGTGCGCCGAGGAGAGGTGCACTCAACCCCCACTACCGTCGAGTCGGTATTGTCTATCACATGTCACTCTAGCGCCCGAACAGCGGCTTTACGCTGCTTGCGCAGCTGTGTTCGCTGGCGGTGAGCGCTGCAAACAACCGCTTTTACCGCTGCAGGAACAGACGCCGTTATGCACAGAACACGCTTGGAAAAGGGATAACTACGAGCTTCTGAGTCGTGAAAGCAAGCCTAAGTCTGTGCGTAAGCGAGTTTGGTCTGTGAGACCGGCTCGTCCGCCTCGCCCACAACGCCGACCAAAAAGACGCAGGCGGGCTCCCGTTAAAAATACCGCAGAACAATCGCTGTCCGCCAAATCCGTAACTCCATCCAAGCCCCGTAAAGGTCGTGGGCCTACGATTCATGAAGCACTGCAACGGATACCATTGCAACGGAACGCTGTTGCCTTAAAAAATACAAGATGGCAGGCGCCCGGGGAGGAAATGTTCCAATGTGGACCTCTGAGTGAAAAAAAAGTTAATGCCTTCCGACAACTGGACACGAGCAGTAGTGATGATGATAATATGCCAGAATTTGAAATTGGTGTGGGTCCGGTATGTGTGGAGGAGAGTGCGGCGGAGTCCCGCGGGGCCCGTCTGGCGCTCCGCAGCGCGGCCATGAGAAGACACGTGGTCAGGGCTGCCACGAGCCTGAAACTAAGCAAGGATCACAGCGAGCTCAGGGCTCTGACTTCAATGATCAAGAAACAATTGAGTGGCCAGAGCTCTACATGTCGTCTTCCGCCACAAACCTTGAACTATTTACTGGCTAAGCGCGGTCTCTCTGATATACTGAGCCAACAGGAACGCAGGTGGCTTAGACAAAGCGGCTGGTCAGGCGGTGAGAGTATTAAATCTGAATCATCGACGTGCGCCGAGGAGAGGTGCACTCAACCCCCACTACCGTCGAGTCGGTATTGTCTATCACATGTCACTCTAGCGCCCGAACAGCGGCTTTACGCTGCTTGCGCAGCTGTGTTCGCTGGCGGTGAGCGCTGCAAGCAGCCGCTTTTACCGCTGCAGGAACAGACGCCGTTATGCACAGAACACGCTTGGAAAAGGGATAACTACGAGCTTCTGAGTCGTGAAAGCAAGCCTAAGTCTGTGCGTAAGCGAGTTTGGTCTGTGAGACCGGCTCGTCCGCCTCGCCCACAACGCCGACCAAAAAGACGCAGGCGGGCTCCCGTTAAAAACACCGCAGAGCAATCGCTGTCCGAAATCAATGTGTGTTCCAATTCATCTACATATGATAGCTCAGAGGATACGGCTATGGGAGGGCTCAGCGAGAGCGAATACATGACTGCTAGCGCTTCCCATGAACTCGAGGTCGGTCAAGTTCCGCCGGATGATATACTGGATCCGTCTGTCCTAAGTCAGATTCCTGACGAAGAGTTCACTGAGTTTTTCAATCAAGCGGAGAGTGGAGCTTCGTTCGTGGAAGGTTCAGAGCTGGTTGCAGCGCTGGAGGCTGTGTTGGACGACAGACCGATTGATATAGATGCCCTGGCTACTGGACGAAAGATACCAATACACACCGCAGTCAGTATGGAATCATCGAGCACGCCTTCATAA

Protein sequence:

>DPOGS211332-PA
MAVESVSGGCSSEVNMFIQDSSGDEQSYISHISDSFFHTKFKIDPSELYTFHDSDVIAGEITVSHTDDNYLFPESSEYKSKFLGSSLKEIDLLGSLTNGEVNNDKKHTVTNGHYSKIIDDIAVNETESKPIPKGQKPRNSVQSHDKVKHNISSPTVFNNTSLSANNSKTVVSNGKNIEKTAPAISPSETFLDVFKREQGVTENTTVKNEPTLTPIKPPIPIAKKQTSGKTKSVTPSKPRKGRGPTIHEALQRIPLQRNAVALKNTRWQAPGEEMFQCGPLSEKKVNAFRQLDTSSSDDDNMPEFEIGVGPVCVEESAAESRGARLALRSAAMRRHVVRAATSLKLSKDHSELRALTSMIKKQLSGQSSTCRLPPQTLNYLLAKRGLSDILSQQERRWLRQSGWSGGESIKSESSTCAEERCTQPPLPSSRYCLSHVTLAPEQRLYAACAAVFAGGERCKQPLLPLQEQTPLCTEHAWKRDNYELLSRESKPKSVRKRVWSVRPARPPRPQRRPKRRRRAPVKNTAEQSLSAKSVTPSKPRKGRGPTIHEALQRIPLQRNAVALKNTRWQAPGEEMFQCGPLSEKKVNAFRQLDTSSSDDDNMPEFEIGVGPVCVEESAAESRGARLALRSAAMRRHVVRAATSLKLSKDHSELRALTSMIKKQLSGQSSTCRLPPQTLNYLLAKRGLSDILSQQERRWLRQSGWSGGESIKSESSTCAEERCTQPPLPSSRYCLSHVTLAPEQRLYAACAAVFAGGERCKQPLLPLQEQTPLCTEHAWKRDNYELLSRESKPKSVRKRVWSVRPARPPRPQRRPKRRRRAPVKNTAEQSLSEINVCSNSSTYDSSEDTAMGGLSESEYMTASASHELEVGQVPPDDILDPSVLSQIPDEEFTEFFNQAESGASFVEGSELVAALEAVLDDRPIDIDALATGRKIPIHTAVSMESSSTPS-