Monarch geneset OGS2.0

DPOGS213509
TranscriptDPOGS213509-TA1794 bp
ProteinDPOGS213509-PA597 aa
Genomic positionDPSCF300033 - 1014371-1027167
RNAseq coverage5x (Rank: top 88%)
Annotation
HeliconiusHMEL0077859e-13776.64% 
BombyxBGIBMGA011684-TA2e-7782.08% 
DrosophilaCG12502-PB1e-4879.17% 
EBI UniRef50UniRef50_B4N3B42e-5277.50%GK12496 n=1 Tax=Drosophila willistoni RepID=B4N3B4_DROWI
NCBI RefSeqXP_001984108.11e-5373.85%GH15186 [Drosophila grimshawi]
NCBI nr blastpgi|1950149392e-5273.85%GH15186 [Drosophila grimshawi]
NCBI nr blastxgi|2700149251e-5444.98%hypothetical protein TcasGA2_TC011532 [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[215-336] IPR0187873.6e-31Protein of unknown function DUF2371, TMEM200
Orthology groupMCL25879 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213509-TA
ATGTGGTCGCACGTGCAGGCACAGGTCTCGATGTGCCTCCTCAGCTCCAGCTCGATCTCCCAGGGCGGCTCCATGAGCGCGCGCCGCGGCGTGCCCGCGCCTCCGCGCGCTGCTACTGCGGGCCCAGGCCGTGCGCGGGCCGCACCTGCGCGAGGCCACCTGCCGCCCCGCCTCCCGTGCGGTGCCCTTCAGCATGCAGCCCGCAGCGCTGGGGGCGCCGGCTGCGGTCGCGGCGCGAAGCTTCAGCTAGCCCGGGACGCAGGCCGCCGGGCGAGTGACCGCGTAGGGCACGAAGGTGAGCGCGACGACGCGCATAGCGATGCGGCCGGCTGCCGGCAGGGGAATGCCGCGGCCGGGTCGCTCGCCGAGAGCCCGGGCGGCCCGGCCGCGCCGGCGCCCCGAGCTCCACGTGACACTTCTCTCCACCCACCGCCCTCCGCGCAGAACGACGACCGTCCGGCGGATCTTTCGGATGCACAATCCTCGACTTCAATTTGCGCTGAACAGGTTCAACGATTGCACGGAGCTGTTCGATTTTGTTTGTTGAATCGGTCTGGATGGAGCAAGGAGGCAGGATGCGTGCGCACAGTCGCGACTCAGATGGCAACGAGAGCGGCAGTCGCTATGAGGCGAGCGCGGAGTGTCACAGGGCCCTTGCGGCGAGCCCACCCCGCATCCGGGGCCACCTGGAACGTCCAGGTGGTCAAGGGGAAAATGAGCTCTCAGTGTCTGTGGCACGCGTGCAGAGCGCTGTCCGCCGGCCTCTTGCTCATGCTGCTGGGAGCGGCCATGGCCGTTATAGGTTACTATGCCGATACGTTGTCAGTGGCGGAAGAAGTGCGTGGCAACTCCACTGTATCAGTGAAGGATGAGGCGAGGGGGTTTCATCTTAACAACCTGTCTTACGCCGGACCCATCGTGATGGGTTTTGGAGGTTTCATAGTAGTAGCAGCCTGTGTTATGACATTCGAGGCTCGGGATAGCGCGGCTAAAGTGACGCCCGCGCGACCCCAGACCATCCCGCGACCCCTGCCTCGCCGGGGTCCATGCGCCCCCGCCAGACTGGACACTCTCGGGGTGTACAGACTACCCCACGTTCTGCCGCTACCACACTCACTGCCACTCGCGCCAGTCAGGGTACGACCACACGTCCACAGAACCAGTGGAGATAAAGCTAGAAATCGTGCGCGTTTCGGCTCCGCTCCGGACTTGAGGAGCGGTAGCGGCCTCGCTACGCCGTCCGTAACCGCGCTACGACGACCGCTACGACGCTACGCTCTCTCTGTTGACGAACCGCCGCATTCAGCTGTCAGAACCCAGCACCATTATCTACACCCTAGCACTATAACCAAACCCAGCTCCCATTCCATATCCAGCGCTAGTGCGGTGGAGTCCGAGTGCGGGTCTCAGTCTTCTTTGGCATTGGATCTTCACGCAAGCGGGGCCTGTGCTGGCGTCACGCTGAGGGTCAGAGATAACACGAGAAGAAGACCCCTGGCAAGACAACAGAGACTCAACGAGGACACTATACACCCATCTGGAGAAAGCGCCGGCCATGTAGCCAACACACGTCAGTTTCCGAGAAACAACGAACAGAACGAAACTGTGGGATCCACTCAGTTAACTGTGGAGCAAGAGGCGCGTGCGCGTAGTGCACCCCCTCCGTGTCATTCCACCCCCACCTCCCCCGTACCGGTCATAACACCCCCCATATCACCGAAACAGGACACAAACATCATCATCGAACAGCCAGAGACAGATTGCACGGATGAACCTCCTCCGGCCGTCGAGTGA

Protein sequence:

>DPOGS213509-PA
MWSHVQAQVSMCLLSSSSISQGGSMSARRGVPAPPRAATAGPGRARAAPARGHLPPRLPCGALQHAARSAGGAGCGRGAKLQLARDAGRRASDRVGHEGERDDAHSDAAGCRQGNAAAGSLAESPGGPAAPAPRAPRDTSLHPPPSAQNDDRPADLSDAQSSTSICAEQVQRLHGAVRFCLLNRSGWSKEAGCVRTVATQMATRAAVAMRRARSVTGPLRRAHPASGATWNVQVVKGKMSSQCLWHACRALSAGLLLMLLGAAMAVIGYYADTLSVAEEVRGNSTVSVKDEARGFHLNNLSYAGPIVMGFGGFIVVAACVMTFEARDSAAKVTPARPQTIPRPLPRRGPCAPARLDTLGVYRLPHVLPLPHSLPLAPVRVRPHVHRTSGDKARNRARFGSAPDLRSGSGLATPSVTALRRPLRRYALSVDEPPHSAVRTQHHYLHPSTITKPSSHSISSASAVESECGSQSSLALDLHASGACAGVTLRVRDNTRRRPLARQQRLNEDTIHPSGESAGHVANTRQFPRNNEQNETVGSTQLTVEQEARARSAPPPCHSTPTSPVPVITPPISPKQDTNIIIEQPETDCTDEPPPAVE-