Monarch geneset OGS2.0

DPOGS208749
TranscriptDPOGS208749-TA1881 bp
ProteinDPOGS208749-PA626 aa
Genomic positionDPSCF300043 + 506318-514377
RNAseq coverage107x (Rank: top 60%)
Annotation
HeliconiusHMEL0152207e-13863.50% 
Bombyx% 
DrosophilaCG6574-PA1e-5935.08% 
EBI UniRef50UniRef50_B0WCL02e-6242.73%Putative uncharacterized protein n=2 Tax=Culicinae RepID=B0WCL0_CULQU
NCBI RefSeqXP_001846444.14e-6342.73%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700371947e-6242.73%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastxgi|1947462198e-6936.86%GF16177 [Drosophila ananassae]
Group
Gene OntologyGO:00160203.1e-112membrane
GO:00068103.1e-112transport
GO:00055423.1e-112folic acid binding
GO:00085183.1e-112reduced folate carrier activity
KEGG pathway 
InterPro domain[1-612] IPR0026663.1e-112Reduced folate carrier
[273-621] IPR0161964.6e-13Major facilitator superfamily domain, general substrate transporter
Orthology groupMCL10612 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208749-TA
ATGCAAGATTGGATAAAGATAACCCTTATACTATGTTCCTTTGGAATGTTGAGAGAGATTCGGCCTTCTGAGCCGTTCGTGACTGAATTCCTTCTTGGTGAATGGCGGAACATTACAGAAGACCAACTCAATCGTGAAGTTTATCCCATAGGAACCTATTCCTATCTGGCTTTACTTGTTGTTGTCTTTTTAATCACCGACTTTCTCCGTTTTAAACCAGTTATAATACTTTCAGGTGTCAGTGGTATATTGGTATATGCTGTGTTGTTATGGACGTCCAGTATACAATGGCTCCAAGTGTCACAGTTCCTTTATGGCTTATATATGGCCACAGAAGTTGCATATTTGACATACATATATGCTAAGGTAGATTCAGCAAAATACCCAGTGGCTTCCTCGTATACTAGAATAGCGGCCTTATCCGGACGTTTCCTATCAGGTGTAAGCTCACAACTATTGACACATTTCGAGCTTATGGACTACAGACAGCTCAACTACATCACACTAATCGCGTATACGCGACCCAAAGTCCTCGTCTGGTCCGCCTTGTATGCAGCCACCCTAGCGTTGTTTGTACAAACTCAAACATATATACAATTATTGTGGAAGCAAATACAAAAAGGAACTGACAGTCCTGTGGTGTACAATGGTGCAGTGGAAGCCACTCAGACCCTGCTAGGTGCGGTGGGTTCCTTCGCATCCTCCCATCTGACTCGCGCCTTATATCCAGGCCTTGCTGGTGTTGCTCAGGGCACAGCTGTATTCCTAGGCGCGTTCATAGATAACGTGTTCGTGTCTTACGCCGGTGTCAGTGGTATATTGGTATATGCTGTGTTGTTATGGACGTCCAGTATACAATGGCTCCAAGTGTCACAGTTCCTTTACGGCTTGTATATGGCCACAGAAGTTGCATATTTGACATACATATATGCTAAGGTAGATTCAGCAAAATACCCAGTGGCTTCCTCGTATACTAGAATAGCGGCCTTATCCGGACGTTTCCTATCAGGTGTAAGCTCACAACTATTGACACATTTCGAGCTGATGGACTACAGACAGCTCAACTACATCACACTAATCGCACAAATCTTGGCAACATTTTGGGCTTTCTGGTTGCCCCCCGTGCCATATGGTATTTATTTCCATAGACAATCTATAGACAATACGTTTCAGGTGGACTATACAAAAACCGACGATAAATCACATTCCAGCCCAACAGACACTGTAAAGGATACCTTCATAAGGAACGTAAAAAACGCCGCGTTTCTTATATATAAACACGCCCGTTTAGCGTATACGCGACCCAAAGTCCTCGTCTGGTCCGCCTTGTATGCAGCCACCCTAGCGTTGTTTGTACAAACTCAAACATATATACAATTATTGTGGAAGCAAATACAAAAAGGAACTGACAGTCCTGTGGTGTACAATGGTGCAGTGGAAGCCACTCAGACCCTGCTAGGTGCGGCGGGTTCCTTCGCATCCTCCCATCTGACTCGCGCCTTATATCCAGGGCTTGCAGGTGTTGCTCAGGGCACAGCTGTATTCCTAGGCGCCTTCATAGATAACGTGTTCGTGTCTTACGCCGGTTATATAGTTATGGGTCTTTTGTACCACTATATTATAACACTAGCGAGTGCCAAAATCGCCTGCCAGTTGACCGACGAGAGTTGTTTCGGTCTGATATTCGGTATCAACACCCTGGTCGGTACTGGACTCCAGTCGATACTGACCGTCGTCTTAATACAGAGTCTATCATTAAACATAACCTCACAATACTTCTCTATCAGCGGTCTCTTCGTAATCTTAGCAGCCGTGTGGATATTGGGATTGATGTTGCACGGTTGCAAACAGAAAAGAATTAATGCCGCCGCCCAGTATTAA

Protein sequence:

>DPOGS208749-PA
MQDWIKITLILCSFGMLREIRPSEPFVTEFLLGEWRNITEDQLNREVYPIGTYSYLALLVVVFLITDFLRFKPVIILSGVSGILVYAVLLWTSSIQWLQVSQFLYGLYMATEVAYLTYIYAKVDSAKYPVASSYTRIAALSGRFLSGVSSQLLTHFELMDYRQLNYITLIAYTRPKVLVWSALYAATLALFVQTQTYIQLLWKQIQKGTDSPVVYNGAVEATQTLLGAVGSFASSHLTRALYPGLAGVAQGTAVFLGAFIDNVFVSYAGVSGILVYAVLLWTSSIQWLQVSQFLYGLYMATEVAYLTYIYAKVDSAKYPVASSYTRIAALSGRFLSGVSSQLLTHFELMDYRQLNYITLIAQILATFWAFWLPPVPYGIYFHRQSIDNTFQVDYTKTDDKSHSSPTDTVKDTFIRNVKNAAFLIYKHARLAYTRPKVLVWSALYAATLALFVQTQTYIQLLWKQIQKGTDSPVVYNGAVEATQTLLGAAGSFASSHLTRALYPGLAGVAQGTAVFLGAFIDNVFVSYAGYIVMGLLYHYIITLASAKIACQLTDESCFGLIFGINTLVGTGLQSILTVVLIQSLSLNITSQYFSISGLFVILAAVWILGLMLHGCKQKRINAAAQY-