Monarch geneset OGS2.0

DPOGS208423
TranscriptDPOGS208423-TA2724 bp
ProteinDPOGS208423-PA907 aa
Genomic positionDPSCF300390 + 87800-91729
RNAseq coverage310x (Rank: top 36%)
Annotation
HeliconiusHMEL0128810.096.50% 
BombyxBGIBMGA009425-TA0.094.31% 
Drosophilabrat-PD4e-17683.00% 
EBI UniRef50UniRef50_UPI00020613D80.063.74%UPI00020613D8 related cluster n=1 Tax=unknown RepID=UPI00020613D8
NCBI RefSeqXP_970374.10.073.74%PREDICTED: similar to AGAP010054-PA [Tribolium castaneum]
NCBI nr blastpgi|910813250.073.74%PREDICTED: similar to AGAP010054-PA [Tribolium castaneum]
NCBI nr blastxgi|910813250.073.95%PREDICTED: similar to AGAP010054-PA [Tribolium castaneum]
Group
Gene OntologyGO:00056223.9e-10intracellular
GO:00055156.2e-10protein binding
GO:00082701.6e-06zinc ion binding
KEGG pathway 
InterPro domain[581-855] IPR0110427.9e-59Six-bladed beta-propeller, TolB-like
[268-394] IPR0036493.9e-10B-box, C-terminal
[606-633] IPR0012586.2e-10NHL repeat
[72-121] IPR0130837.3e-10Zinc finger, RING/FYVE/PHD-type
[221-258] IPR0003151.6e-06Zinc finger, B-box
Orthology groupMCL15914 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208423-TA
ATGTTACGTCACAGTGTATCTCACGAAATACGAACTCGGGATGCTGGTTTCCTGAAGATGGCCTCACGCACCCCGTCCCTTGAGTCGCTGCCTGGTGCCAACTCTATAGGTTCCTTGGAGCGTGGCTCCTTATCGCCCCTGACTTTGAGCGGATCCTCGCCCCCTGCGAGCGATTCCGCCGTATGTGATCTTCGTGAGTTCGACGGCCTCGACACCACCTGCGCTATCTGCCGAGAGACATTCGTCGACCCCAAAGTACTCAATTGCTTCCACACTTTTTGCCGCGGGTGCCTAGAGCGGGAACAAACTCATCCCGAGAAAGTCACTTGCGTCACGTGTCGTGTAGACAGCCAGCTGCCACCAGCGGGTGTGCCTGGCCTGTTAACGAACCTTGTAATCGCCGCCGCAGTCGAGCAGGACGCTGATCTTTTGCCAACTGCACGTCAGACAGCCTCGCCCTCAGCTCGCTGCACCGGCTGCAAATCAAAAGAATCTGACGCTGTTGCGCGCTGCGTAGATTGTGCCAACTTCCTTTGTCCTAATTGCGTCATGGCACACCAATTCATGCATTGTTTCGAAGGCCATAGGGTTCTTGCCTTCACAGACCTGAAAGACGATAAAGGCATGCTAGGCTCACTCACCCCAAGCGGGGACAAGACTGCTTTTTGCCCAAGACATAAAAATGATATACTGAAGTATTTCTGCCGTACTTGCTCAGTGCCAGTGTGCAAAGAATGTACCATTATCGAACATCCAGCAACTTTACATGACTGCGAGCATCTCTCTGACGCTGGACCTAAACAGCTAGAACTTATGCAGCAAGCAGTCAACGAAGCTAAATCTCGTGCTACCGAAATAAGGCACGTTGTAAAAACGGTAGAACACGCAGCGGGAAAATTACAAGTCCAATATCACAAAGCTCAAAATGAAATCAATGACACTTTTCAATTCTATCGCTCTATGCTAGAAGAGCGGAAGCAGGAGTTGCTTAAAGAACTCGAAAGTGTTTTCTCAACTAAACAAATTGCTTTGACTGTTGTCGGACAAAAAGCTCAAGAAACTGTTGAGAAAATTTATCAAACCTGCGATTTCGTGGAGCGATTGACAAAGTGTGCAAACATCGCTGAAATTCTAATGTTTAGAAAACTGTTAGATACTAAATTGCAATCTTTAATGAGCACTAATCCAGAACAGAGCGTGCAGACTGCTTGCGAACTAGAATTTGTATCTAATTATCAAGCGATACAAGTCGGTGTAAGGAACACCTTTGGGTACGTTCGTTCAAGCTCGGAAGCTAATGTTGGTCCCACTAAACAACCTCCTATCGCGCGTCCAACAAATGGATCCCTGTTGAACGGAGGTTCTTCATCAAGCAGCAGCAGTGTAAATGGAAGTTCCGGTAGCCTCAATGGAGGAATTCATTTACCTACAGGATTAAATGGCGTCCTAGATCGCCCTTACACCAACGGACTACTCGGACCGACAAGTCAGTCAACTTCACCGTTTGATTCAAACATTATTTCAAAAAGATTCAACAGTCCTAACACTTTGGGGCCGTTCTCTACTGCCATCGGAGAAATTAATTTGAACGGAATAAATCCATATGAAAAATGGTCTAACGGCGGTTGCGACACATTGTTTCCTCCAACCACTACTGATCCTTACTCATTAACAGCGGCTGCGCACAATGACCCTATTCTTGATTTAACTAATAAGTTGATATCCACTGCCATTTTTCCACCAAAGTCTCAAATTAAACGACAAAAAATGATTTATCACTGTAAATTCGGCGAGTTTGGTGTAATGGAGGGTCAATTTACCGAACCCAGTGGTGTTGCAGTTAACGCTCAAAACGACATAATCGTTGCTGATACAAATAACCATAGAATTCAAATATTTGACAAGGAAGGAAGGTTTAAATTCCAATTCGGTGAATGCGGTAAGCGTGATGGACAACTTCTGTATCCAAATAGAGTGGCAGTAGTGAGAACTTCAGGTGACATAATTGTAACTGAAAGATCACCAACACACCAAATACAGATTTACAACCAGTATGGTCAATTCGTTAGGAAATTTGGAGCCAATATTCTTCAACATCCTCGCGGCGTAACAGTTGACAACAAAGGAAGGATTGTTGTCGTAGAGTGCAAAGTGATGCGCGTCATCATCTTCGACCAAGTTGGCAACGTGTTACAAAAGTTTGGTTGTTCAAAACATCTAGAATTCCCCAACGGTGTTGTAGTCAACGACAAACAGGAAATATTTATTAGTGACAACCGCGCGCATTGCGTCAAGGTCTTTAATTACGAAGGTATTTATCTGCGTCAAATCGGTGGCGAGGGAGTGACAAATTACCCTATCGGCGTTGGCATCAACGCGTCTGGAGAAATACTGATTGCTGATAACCATAACAACTTCAATTTGACGATATTCACCCAGGATGGTCAGCTGGTGTCCGCCTTAGAAAGCAAAGTTAAACATGCCCAGTGCTTCGACGTCGCTTTAATGGACGACGGATCAGTCGTGCTCGCCAGCAAAGATTACCGGCTTTACATCTACCGCTACGTGCAAGTTCCGCCTATAGGGAAATGTGATGTTAGGGGTTTTATCAATGACACAGTTGACTGCGGCGATCTCTTCACAGCACGTGTTGCACGTGCGACGGTAGGTCCTTCAGGAAATTACGCATTCTTAAAAACCTACCGTTCACTCGTGAATAAATAG

Protein sequence:

>DPOGS208423-PA
MLRHSVSHEIRTRDAGFLKMASRTPSLESLPGANSIGSLERGSLSPLTLSGSSPPASDSAVCDLREFDGLDTTCAICRETFVDPKVLNCFHTFCRGCLEREQTHPEKVTCVTCRVDSQLPPAGVPGLLTNLVIAAAVEQDADLLPTARQTASPSARCTGCKSKESDAVARCVDCANFLCPNCVMAHQFMHCFEGHRVLAFTDLKDDKGMLGSLTPSGDKTAFCPRHKNDILKYFCRTCSVPVCKECTIIEHPATLHDCEHLSDAGPKQLELMQQAVNEAKSRATEIRHVVKTVEHAAGKLQVQYHKAQNEINDTFQFYRSMLEERKQELLKELESVFSTKQIALTVVGQKAQETVEKIYQTCDFVERLTKCANIAEILMFRKLLDTKLQSLMSTNPEQSVQTACELEFVSNYQAIQVGVRNTFGYVRSSSEANVGPTKQPPIARPTNGSLLNGGSSSSSSSVNGSSGSLNGGIHLPTGLNGVLDRPYTNGLLGPTSQSTSPFDSNIISKRFNSPNTLGPFSTAIGEINLNGINPYEKWSNGGCDTLFPPTTTDPYSLTAAAHNDPILDLTNKLISTAIFPPKSQIKRQKMIYHCKFGEFGVMEGQFTEPSGVAVNAQNDIIVADTNNHRIQIFDKEGRFKFQFGECGKRDGQLLYPNRVAVVRTSGDIIVTERSPTHQIQIYNQYGQFVRKFGANILQHPRGVTVDNKGRIVVVECKVMRVIIFDQVGNVLQKFGCSKHLEFPNGVVVNDKQEIFISDNRAHCVKVFNYEGIYLRQIGGEGVTNYPIGVGINASGEILIADNHNNFNLTIFTQDGQLVSALESKVKHAQCFDVALMDDGSVVLASKDYRLYIYRYVQVPPIGKCDVRGFINDTVDCGDLFTARVARATVGPSGNYAFLKTYRSLVNK-