Monarch geneset OGS2.0

DPOGS205633
TranscriptDPOGS205633-TA2427 bp
ProteinDPOGS205633-PA808 aa
Genomic positionDPSCF300023 - 399552-413123
RNAseq coverage93x (Rank: top 62%)
Annotation
HeliconiusHMEL0071050.060.28% 
BombyxBGIBMGA001143-TA2e-11159.24% 
DrosophilaCG11665-PA2e-0928.95% 
EBI UniRef50UniRef50_D1ZZX73e-1423.99%Putative uncharacterized protein GLEAN_07380 n=2 Tax=Tribolium castaneum RepID=D1ZZX7_TRICA
NCBI RefSeqXP_976475.11e-1324.08%PREDICTED: hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|2700053311e-1323.99%hypothetical protein TcasGA2_TC007380 [Tribolium castaneum]
NCBI nr blastxgi|2700053317e-2323.53%hypothetical protein TcasGA2_TC007380 [Tribolium castaneum]
Group
Gene OntologyGO:00550852e-08transmembrane transport
GO:00160212e-08integral to membrane
GO:00080611.2e-07chitin binding
GO:00060301.2e-07chitin metabolic process
GO:00055761.2e-07extracellular region
KEGG pathway 
InterPro domain[1-597] IPR0161961e-30Major facilitator superfamily domain, general substrate transporter
[56-543] IPR0117012e-08Major facilitator superfamily
[741-806] IPR0025571.2e-07Chitin binding domain
Orthology groupMCL26612 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205633-TA
ATGGAAGGTGACGAACAATCAAAGGGCTTTTCATGGCGCTGGCCGCTGCTCATATCCGCTATATTTTTAAATATATGCATTCCAAATCTAATTTTATCGTATGGCGTCCTATGGGTTCAGTTGGCGAGTAATGAAGTGCCGCTATGGGTAGGCCTGGCTGCGCCGTCTTCATATCTATTTGTGTATGGTTTGACACAATGTTGGTTCAGAGAAGCTGCGGATTCTTGGGGTGGACCAGTGGGGTATCGAGTGATGACTGCCACCGGACTATTCATAATTATCTTAAGTTTGATTATTTGTGCCTTCATTCCGCTATATTTTCAACCGGTCGTTTATGGAATACTTGGAGGTTTTGGAGCGTCCTTGATTTCCACTCAAGTTGATGCAGTTTTGTACGAGACTTACGACTCTCGTCTAGGACTCGTAAGGGGCATGTGCTTTACTGGGCAGGCCGTTGGACTGTCACTTTTTCCTCATATTTTGTTTGTTCTCACAAACATTTACGGCTATGTCTATTCTTATATTGTACTTGGAGGTATCATGCTGCAAACCTTACCCGCTATTTTGTTACTGAAGGTAGATGAAACAAAAAAATACGCTTATTTTTCTAATTATAAAGAATTGTCACAGACTTTAACATGTTATAAAAATGAAGGTATTGAAAATTTGTTCGGTACTGAACTGCAACTGCATAGCTTAACAAAGAAATGCTGGAAGAGTCCATCAGACGATAATTTACACCGGGTAGATGAGTTTGATTGTGATGACGGTAATATTTTGGAAACTATAACTCCGCCGCCCAGTCCAGAAGAGAAAAGAAGAAATGTATTTGGTGTTGAAATCCTTCCTGAAATTCCAGAAGAAAGTGAAACTGACGAAATAAGCATCTCAGAACTTAGTTTAAGTGTAAATAAGAAACGTTTAAGTAACGCTATAAAAAGATTTAGTACTTTGGGAGATAATATAGATGAATGTATAACAAGCCAAGTGAGGAAAGATTCCTTAGGAAATATTGAAGTAAATGAAAACACTGAAATAGAAGTAACTTACGACACTATAGAACCTATAACAGATATTCAAACGGAGAAGCTTTTTAATTCATTTGGATTTAGAAGTCGAACGACATATACTAATCTGAAAAGAAAATTTTGGATACCTTCTTACAAATTATACAAAGCTAAGCGAAGATTTATGTACCTGATATGTACTATAAATGATACATTTCTGAAGCCCCTTACTAGATCGTTATCTTGCGGAGCATTTTATCCAGCCCTGTTACTTAATTTCACCAAATTGAGTTTAACTGTTCAAACAGTACTTATGCCAGTGATTGCATCACGAATGCATCCAAATATACCAACACTTGAGGCAAATTTTCTCATTTCATTACAAGGCTTTACTTGGATATGTTTCGCGATATGCACGCCGTGGTTGGTTCAAACAAAGAGGAGTAATTACAAATACATTACAGTTTGTGGTCTAGCCATATCAACGTGTGCTTGTTTTGTGTTCTTGCGAGCTGAAAACCTAGACCTTTTTTCCATTGGTTATGTGGTAGCGGGCTTCGGTTACGGAATTATAACATCAAGTTGGGAGAAAACAGCGCACGAGTTAGTCGGAACCCGGAAATGGGCAAAAATTCATAGTACCTTGGAAACTCTATCTGCGGCTCTGATAGCTATATTTTGTGTAGGACTCTCTTTTGTAATTGGTAGAGACAACGGTCTTCAAATGTCCCTTTTAATTATAGGTGTAACTATGTCCGCTATTTCTTTGATTTGGGCGATCATAGTTATGTTAATTTTAATGCTGCCGTGCGTAGCGTGCTCAAATATTGACTGCAATGGGAAAGCTTTTCACTGCGTGAATTCGACACATTTCAAGATATGCGTTGATTTTGGCGGAGGAATATCGACGACTGTGGACGAATATTTAATTCCGTGTCCACAAGATACAGTGTGTAAATCATATAACCTATATGAATGTGAGTATGAGAAAACGACCACTTCTTCTACAGTGACAGTGCAAAGTGAAGTAACGACCGAAGTGATTGAAAGTATTGGTGTGTCCTCTGGGGAATATTCGGATGTAACAACAGTATTTCCTACAAATCAGGAGCAAGATAAAAGAAATACAAAATGTACCCAAGAATCACTTAGCATGTCTAAATCAGAAGAAAATAATTCGAAGAATACAAAAGAAGTATCTGTTGAAAAGTCCAAGGAAGGATTTGCTAAGAAAGATTATTTCGAATGTGAAAGAGAAGGCAAATTTGAAGATCCAGAGAACTGTAGAAAATATTACGTATGTAAGAAAGCTAGAAACTCAACATTTAGGCGTAAGATAAAAGCATGTGACTCCGATGAAGTGTTTCATAAGAAGAAAGGGAAATGTGTAGACGAAGAAAGCTATGAATGTAACATTTAA

Protein sequence:

>DPOGS205633-PA
MEGDEQSKGFSWRWPLLISAIFLNICIPNLILSYGVLWVQLASNEVPLWVGLAAPSSYLFVYGLTQCWFREAADSWGGPVGYRVMTATGLFIIILSLIICAFIPLYFQPVVYGILGGFGASLISTQVDAVLYETYDSRLGLVRGMCFTGQAVGLSLFPHILFVLTNIYGYVYSYIVLGGIMLQTLPAILLLKVDETKKYAYFSNYKELSQTLTCYKNEGIENLFGTELQLHSLTKKCWKSPSDDNLHRVDEFDCDDGNILETITPPPSPEEKRRNVFGVEILPEIPEESETDEISISELSLSVNKKRLSNAIKRFSTLGDNIDECITSQVRKDSLGNIEVNENTEIEVTYDTIEPITDIQTEKLFNSFGFRSRTTYTNLKRKFWIPSYKLYKAKRRFMYLICTINDTFLKPLTRSLSCGAFYPALLLNFTKLSLTVQTVLMPVIASRMHPNIPTLEANFLISLQGFTWICFAICTPWLVQTKRSNYKYITVCGLAISTCACFVFLRAENLDLFSIGYVVAGFGYGIITSSWEKTAHELVGTRKWAKIHSTLETLSAALIAIFCVGLSFVIGRDNGLQMSLLIIGVTMSAISLIWAIIVMLILMLPCVACSNIDCNGKAFHCVNSTHFKICVDFGGGISTTVDEYLIPCPQDTVCKSYNLYECEYEKTTTSSTVTVQSEVTTEVIESIGVSSGEYSDVTTVFPTNQEQDKRNTKCTQESLSMSKSEENNSKNTKEVSVEKSKEGFAKKDYFECEREGKFEDPENCRKYYVCKKARNSTFRRKIKACDSDEVFHKKKGKCVDEESYECNI-