Monarch geneset OGS2.0

DPOGS210507
TranscriptDPOGS210507-TA1674 bp
ProteinDPOGS210507-PA557 aa
Genomic positionDPSCF300186 - 1136-5406
RNAseq coverage2118x (Rank: top 6%)
Annotation
Heliconius% 
BombyxBGIBMGA012593-TA9e-1956.63% 
DrosophilaBsg-PG9e-3139.73% 
EBI UniRef50UniRef50_UPI0000D566B53e-3135.22%UPI0000D566B5 related cluster n=1 Tax=unknown RepID=UPI0000D566B5
NCBI RefSeqXP_966740.16e-3235.22%PREDICTED: similar to AGAP008408-PA [Tribolium castaneum]
NCBI nr blastpgi|910855551e-3035.22%PREDICTED: similar to AGAP008408-PA [Tribolium castaneum]
NCBI nr blastxgi|1954729072e-3340.00%GE18733 [Drosophila yakuba]
Group
KEGG pathway 
InterPro domain[410-508] IPR0137832.1e-08Immunoglobulin-like fold
[414-509] IPR0035995.9e-08Immunoglobulin subtype
[413-508] IPR0130986.7e-07Immunoglobulin I-set
Orthology groupMCL16576 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210507-TA
ATGGCACCGTGCGCAGAGAACGAGGCGGCGGCGGGGGAGGAGAGCCGAGTGACTGTGACGGCCGGGGCTCCGCTGACGGTGGAGTGTCGCCTGCGGCCCGAGCAGCGCGCCGAGTGGCGTAGGGACGGGGCCACGCCGCCGCCCGACTTGCGTGCGGCCCCCGAAGCGGCAGCAGGCTCGGGGCTGGCCGCTCGCCTGCAGGCCCCGGCCGCCCGCGATCACCACGCCGGCCTCTACACTTGCAGCCGGGAGCGCGACCACCGCGTGCGCGTCGTGGTGCTCCCCGGTACTGGTCCCGGTGCAGGCCGAGCATGTTTCTGGCTCCTTGCGGTGACCCTCCGTCACTCCGCCGTCACTCCGTCTGCTCGTCCTCTCTCGGAGTCGGCCGCCCCTGTGTCTGTGTCTCCCTCGGCCTCCGAGGCCGTCCCCGCCGCACCCGCCGGGCCCGTGTCTCCCGCCGCCGCCGGGGCCTCGGACCTCGCAGAGCTGTTGTACGACGTGCGCGGTAACCTGTCCCTGCACTGCTCGCTGCCCAACGAGAACAGCCTCGCATATGTCTGGACTAAGAACGGTACGGCCCTAGAGCAGGTGTGGGAGATGACAGGCCGCTACGTGCTAGAGAAGGGAGGAGCAGAGCTGAGACTGGCCCGGGCCCTGGAGGACGACTTCGGCAACTACACGTGTGGAGTCGCGGGAAGGAGCGAGACACAGGGCTGGGCGGTGCGCGGGCGACCGCATCTCAAGGTGCCCGCCAACACCAACGTGGTGGAGGGGCAGCGGCTCAAGCTCGTGTGCAAGGTTATTGGCAAACCTTACCGGCCTGTCAGTTGGTGGTACTCCAACTCCTCGGACGACGAAGGCAACTTCACGGAGGTGACGGCGGCGCTCGGGGCGCGCGCCGTAGTGGGCTCGGGCGAGGGCGGAGCGCCCGGCGCCGTGCTCACCGTGGAAGCGGCGGCGCGCTCCCCGCACCGTTATCTTTTCCTTATCACGGTAATCTCCGAGGACGCAGCGGCGGCCGCGATAGCGACACCGAGCAACAATCTCATGACGACTAAGAACGGTACGGCCCTAGAGCAGGTGTGGGAGATGACAGGCCGCTACGTGCTAGAGAAGGGAGGAGCAGAGCTGAGACTGGCCCGGGCCCTGGAGGACGACTTCGGCAACTACACGTGTGGAGTCGCGGGAAGGAGCGAGACACAGGGCTGGGCGGTGCGCGGGCGACCGCATCTCAAGGTGCCCGCCAACACCAACGTGGTGGAGGGGCAGCGGCTCAAGCTTGTGTGCAAGGTTATTGGCAAACCTTACCGGCCTGTCAGTTGGTGGTACTCCAACTCCTCGGACGACGAAGGCAACTTCACGGAGGTGACGGCGGCGCTCGGGGCGCGCGCCGTAGTGGGCTCGGGCGAGGGCGGAGCGCCCGGCGCCGTGCTCACCGTGGAGGCGGCGGCGCGCTCCGACGCCGGCCGCTACCGCTGCAGCGCGCCGGACGCCACACTGCCCGCCACCACCACGCTCCGGGTCAAGGACATGTACGCCGCCCTATGGCCCTTCCTCGGCATCTGCGCCGAGGTGTTCGTGCTCTGCGCCATCATCCTGGTATACGAGAAGAGACGCACCAAGCCCGAGCTCGACGACTCCGACACCGACAACCACGACCAGAAGAAGTCGTAA

Protein sequence:

>DPOGS210507-PA
MAPCAENEAAAGEESRVTVTAGAPLTVECRLRPEQRAEWRRDGATPPPDLRAAPEAAAGSGLAARLQAPAARDHHAGLYTCSRERDHRVRVVVLPGTGPGAGRACFWLLAVTLRHSAVTPSARPLSESAAPVSVSPSASEAVPAAPAGPVSPAAAGASDLAELLYDVRGNLSLHCSLPNENSLAYVWTKNGTALEQVWEMTGRYVLEKGGAELRLARALEDDFGNYTCGVAGRSETQGWAVRGRPHLKVPANTNVVEGQRLKLVCKVIGKPYRPVSWWYSNSSDDEGNFTEVTAALGARAVVGSGEGGAPGAVLTVEAAARSPHRYLFLITVISEDAAAAAIATPSNNLMTTKNGTALEQVWEMTGRYVLEKGGAELRLARALEDDFGNYTCGVAGRSETQGWAVRGRPHLKVPANTNVVEGQRLKLVCKVIGKPYRPVSWWYSNSSDDEGNFTEVTAALGARAVVGSGEGGAPGAVLTVEAAARSDAGRYRCSAPDATLPATTTLRVKDMYAALWPFLGICAEVFVLCAIILVYEKRRTKPELDDSDTDNHDQKKS-