Monarch geneset OGS2.0

DPOGS201131
TranscriptDPOGS201131-TA1992 bp
ProteinDPOGS201131-PA663 aa
Genomic positionDPSCF300065 - 641211-644394
RNAseq coverage71x (Rank: top 66%)
Annotation
HeliconiusHMEL0137347e-8139.85% 
BombyxBGIBMGA003935-TA8e-11458.58% 
DrosophilaCG6125-PA8e-7743.94% 
EBI UniRef50UniRef50_UPI00021A78281e-9848.76%UPI00021A7828 related cluster n=2 Tax=unknown RepID=UPI00021A7828
NCBI RefSeqXP_001606214.13e-9044.74%PREDICTED: similar to sulfate transporter [Nasonia vitripennis]
NCBI nr blastpgi|3504068902e-9949.25%PREDICTED: sodium-independent sulfate anion transporter-like [Bombus impatiens]
NCBI nr blastxgi|3504068901e-10649.25%PREDICTED: sodium-independent sulfate anion transporter-like [Bombus impatiens]
Group
Gene OntologyGO:00068101.4e-22transport
GO:00550851.4e-22transmembrane transport
GO:00160211.4e-22integral to membrane
GO:00052151.4e-22transporter activity
KEGG pathway 
InterPro domain[132-321] IPR0115471.4e-22Sulphate transporter
Orthology groupMCL30206 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201131-TA
ATGACGAAACCGGAACAGCTGGAGGAAGAGGAGGAGCTGCAGCTGGACAGGGCGGGCGTGCGGCCAGCCCTGGAGCGCCTGCTGCCGGCGGCGCGCTGGGCCAGACTGTACTCGCGGACGGCGGCCGCGGCGGACCTGGTGGCCGGCCTCACGCTTGGCCTCACACTCGTGCCGCAGTCCATCGCCTACGCCGCACTGGCCAACATGCCCGTCCACTACGGCCTGTACTCCGCCCTTGTCGGGTCGCTGGTGTACTCCGTGCTCGGCACGGTCCGTCAAGTGTCCATCGGCCCGACGTCCTTGACGTGTCTGATGACGTTGTCGGCGACCCGCGGCCTGCCCCCGGACGCGGGCGTGCTGCTCTCCTTCCTGGCCGGGTGCGTGGTGCTCGCCATGGGACTGCTCAGACTCGGATTTCTGGTGGACCTGATCTCTCCAGCGGTGACGAGCGGCTTCACCACCGCCACGGCCATCATCATCGTGTGCTCGCAGCTCAAGGGACTACTGGGACTCCGCTTCACGGCCGAGTCTCCCGCCGAGAACCTGACGCTCATCCTGCAGCAGTGGAGGCTCGTGAGGACCAACGACCTCGCGCTGGCGGTCATCTGTTGCACCGCCCTGCTGTTACTACGAAAGCTCAAGGACCTGCCGGTGAGTCCTAAGAAACCGAAGCTGAAGAAGGCGCTCTGGCTAATTTCGATCTCCCGCAACGCGCTCGTGGTGCTGGCTGCCTCCACGTTCGCTTACTGCAGTTACGACCAGCGGCAGCCTCTGTTCCTACTGTCAGGAAAAGTGGAGCCGGGCCTCCCCAAGCTCGCTCTGCCGCCCTTCAGCACCCGACTCGGAAACGAGACCTTGGGCTTCGTGGAGGTAACGCGCCGCCTCGGACACAACGTGCTGCTGCTGCCCTTCATCATGGTCATGGCTAACATTGCCATCGCCAAGGCCTTCGGTGAGTCTCGGTCCGTCCATCTTCTCCCCCTCCCCCCCTTCCCCGGGTCTTATACTGTGGTGATTGTGTCGACAGCGGAGGGCGGGCGCGTGGACGCCACTCAGGAAATGTTGACGCTGGGCGTGTGTAACATCCTATCATCACTGGTCCGCGGCCTGCCCTCGTGCGGCGCCTTCACCCGCTCGGCCGTCAGCCAGGCGTCGGGCGTGCGGTCGCCGGCGGCCGGGGTGTACTCAGGTGAGCGTGGTTCCTCGGCCGCCCAGTCCGTCCTTCGAGCCGACTCTCTCACGTGTGACCGTTCCAGGAGCCGTGACGCTGCTGGCCCTCGTGTACCTCACGGAGTACTTCTTCTATATACCGAGGGCCTGCCTCTCGTCCGTCCTCATCTGCGCCGTCGTGTTCATGGTGAGTGTCCGCGCCCGCCGCAGGTCGGAGGCGGTCGGTTGGGTCTCATGCGCTGGTCGTCCTTAGATCGACCTCTCCTTCGTGCTCCGCGCGTGGCGCTCGTGTCGCTGGGAGGCAGCGGTGGTGGTGGTGACGTGCGTGTCTTGTGTGGCGGGCGGCGTGGGCGCTGCCGTGGCAGGAGGCGCGCTATGCTCGGTGGCAGGCCTGCTGCGGGCGGCGTGTCCCGGCCCGCTCGTGCGTCGCCGCGGCTCGTCGCTGCTGGTCCGCCCCTTGCTGGTCCGCCCCCGCCGTTCGCTCGTGTTCGTGAACGCGGATCGCGCGGCGAGTGTCGTCCGGTCGGCACTGGCCGCGTCGCCTCTGCTCCGGCGGAAGTCGTTCGACTGCGGCTCCCTCGTGTTACTCGACTACACCGCCCGGAGGCCGCCCGTCCGCAGGTCCTGGAGCGGTTGATCGAGGAGCTGGAGGCGGCCGGGTATGACGTGGTGTTCTACAACGCGAACCCCGAGGTCGAGGCCGACTTGCGGCTTTTGGCGCGTCTGGACCCGAGCTGGCTCCTCGTGACTTCGTCTCCCGCCCCGCCCCGGAGCGCTCCTACAGGCGAGGAGGGGGCGCTGCTCGGTGACGCGGAGGTGTAA

Protein sequence:

>DPOGS201131-PA
MTKPEQLEEEEELQLDRAGVRPALERLLPAARWARLYSRTAAAADLVAGLTLGLTLVPQSIAYAALANMPVHYGLYSALVGSLVYSVLGTVRQVSIGPTSLTCLMTLSATRGLPPDAGVLLSFLAGCVVLAMGLLRLGFLVDLISPAVTSGFTTATAIIIVCSQLKGLLGLRFTAESPAENLTLILQQWRLVRTNDLALAVICCTALLLLRKLKDLPVSPKKPKLKKALWLISISRNALVVLAASTFAYCSYDQRQPLFLLSGKVEPGLPKLALPPFSTRLGNETLGFVEVTRRLGHNVLLLPFIMVMANIAIAKAFGESRSVHLLPLPPFPGSYTVVIVSTAEGGRVDATQEMLTLGVCNILSSLVRGLPSCGAFTRSAVSQASGVRSPAAGVYSGERGSSAAQSVLRADSLTCDRSRSRDAAGPRVPHGVLLLYTEGLPLVRPHLRRRVHGECPRPPQVGGGRLGLMRWSSLDRPLLRAPRVALVSLGGSGGGGDVRVLCGGRRGRCRGRRRAMLGGRPAAGGVSRPARASPRLVAAGPPLAGPPPPFARVRERGSRGECRPVGTGRVASAPAEVVRLRLPRVTRLHRPEAARPQVLERLIEELEAAGYDVVFYNANPEVEADLRLLARLDPSWLLVTSSPAPPRSAPTGEEGALLGDAEV-