Monarch geneset OGS2.0

DPOGS215727
TranscriptDPOGS215727-TA1350 bp
ProteinDPOGS215727-PA449 aa
Genomic positionDPSCF300041 + 336851-383274
RNAseq coverage111x (Rank: top 59%)
Annotation
HeliconiusHMEL0096542e-4591.07% 
BombyxBGIBMGA005819-TA5e-7066.22% 
Drosophilabib-PA4e-8445.79% 
EBI UniRef50UniRef50_D6W7H36e-8840.12%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6W7H3_TRICA
NCBI RefSeqXP_968782.11e-8840.12%PREDICTED: similar to aquaporin transporter [Tribolium castaneum]
NCBI nr blastpgi|910926362e-8740.12%PREDICTED: similar to aquaporin transporter [Tribolium castaneum]
NCBI nr blastxgi|1700622764e-9343.36%aquaporin transporter [Culex quinquefasciatus]
Group
Gene OntologyGO:00160201.6e-104membrane
GO:00068101.6e-104transport
GO:00052151.6e-104transporter activity
KEGG pathwaybta:2810081e-29 
 K09866 (AQP4)maps-> Vasopressin-regulated water reabsorption
InterPro domain[1-284] IPR0004251.6e-104Major intrinsic protein
[52-268] IPR0232713.8e-46Aquaporin-like
Orthology groupMCL13582 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215727-TA
ATGGGGGAACAGACATTTAATATTGATGACAACAATCTGGAACATCACATAATAACGTTGTTCGATAAGTTGGAGTGTTTACGTCGGGATGCTAATCAGAGTGACAACATGCTGGCCGGTCGTGTGCCAGCGAGACTTGAAGTGCGGACTCTTTCGTTGTGGAAGGCTGTCGTGGCTGAATGTGCGGCGTCCTTCCTCTATGTGTTCATCGTATGCGGCGCCGCCGGCGGGGCTGGTGTGGGAGCCTCCGCCTCGGCTGTGCTTCTCGCTAACGCACTCGCATCCGGCTGCGCTATAGCCACCTTAACTTTATGTTTCGCTCATATTTCTGGAGCCCATATAAACCCTTGCGTGTCTGTGGCCCTGTGCGTGCGCCGGCGCGTGTCTCCTTTACGAACAGCGCTGTACATAGCGGCGCAGTGCGGAGGCGCTATTGCTGGCGCGGCCGCTATCTACGGAGTGAGTGTGCCTGGTTATGCCGGTGGATTATCAGCGTCGTTGTCGGTGGGTGCGGGCGCGGCTTGGGAGCGTCTTGGCTCCGAGTTACTGCTGTCGGGGCTCGTGGCTGCTGCCTACTTCGCGGCCATGGAGAGGAGATGGACGGGTGCTGCACCGGCGCTTCTCGGAGCGGCCTACTGTGCTGCGAGCTTTGTCTCCATGCCGTCGTTAAATCCAGCTCGTTCCCTGGGGCCGTCGTTCGTGCTGTCTCGTTGGGAGAGTCACTGGGTCTGCTGGGTGGGGGGTCTGGGCGGAGGCATAGCATGCGCTCTCCTCCACGAGTGGGGCACTAGGCGGCCTCGCGGCAACTCCAATTCCCGTGCTTCATCGCCTAGGGATCTGGACGAGTTGGATAAACCCGCGTTCCCGAGTCACCACCACTACCGCCAGCCGACGTACTGCGCCGCCGCCCCCCGGCCCGATACAACGGAGCCCCTGTACTCCGGTACCAAGTCCCTGTACTGTCGCTCGCCGCCGCCCGCCCGACACACGCTCCACAGGTCTCAGTCAGTGTACAGTAAGAGCAGCAGTGCTGGTGGAGCGGCTGTCGGCAGTGTGGGCGTGACCACGGCCGTGGGCGGCGTGGGCGCGGTGGCTGCCACGTCACTGGCCACAGCCCAATCTCTACTGCTGCGTCCTCCGCACACCGTCACCATGAACCAGAATGTACACAACGCACAAAGAGACCCGGCCTATGGGACCACAGCTAACGGAGTCAGACCCGGGCCCGCAGGGTCCGCGGCGCCCGATCGTCGCGAGTCCCTGTACGGAGCCCGCCGCGGGCCGCTGTCGTCGGAGGACAGCGCGTACGGCAGCTTCGCCCGCGGGTACCGCCCGCCCGACCACTACTAG

Protein sequence:

>DPOGS215727-PA
MGEQTFNIDDNNLEHHIITLFDKLECLRRDANQSDNMLAGRVPARLEVRTLSLWKAVVAECAASFLYVFIVCGAAGGAGVGASASAVLLANALASGCAIATLTLCFAHISGAHINPCVSVALCVRRRVSPLRTALYIAAQCGGAIAGAAAIYGVSVPGYAGGLSASLSVGAGAAWERLGSELLLSGLVAAAYFAAMERRWTGAAPALLGAAYCAASFVSMPSLNPARSLGPSFVLSRWESHWVCWVGGLGGGIACALLHEWGTRRPRGNSNSRASSPRDLDELDKPAFPSHHHYRQPTYCAAAPRPDTTEPLYSGTKSLYCRSPPPARHTLHRSQSVYSKSSSAGGAAVGSVGVTTAVGGVGAVAATSLATAQSLLLRPPHTVTMNQNVHNAQRDPAYGTTANGVRPGPAGSAAPDRRESLYGARRGPLSSEDSAYGSFARGYRPPDHY-