Login
Help

TRANSCRIPT CARD

Submit your Data

  1. Transcript 'KH2012:KH.C4.671.v1....'

Transcript Model

Transcript Id

KH2012:KH.C4.671.v1.A.ND1-1

Possible name(s)

SALL1; SALL3; SALL4

Location

KhC4 [5,239,794 / 5,246,210]

Sequences

Amino acid sequence

Length: 940

>KH.C4.671.v1.A.ND1-1
SGMPGVNGFQLQNMQQLLAAAASMSGTHTPGNNLLPSVAAAIGRHFNANNTSVEKLSETV
SQLPKAKDLEMEMNEALQQESRTKNGEEENAFYGGRIGNPQEPPLYRNCSGMNSDPNQPG
GGYGTAEESLSRRSCRFCQKVFGSESALQIHLRSHTGERPFKCNICANRFSTKGNLKVHF
SRHQEKYPHIEMNANPIPEYLDNVPTSSGIPYGMSIIPDFTEVPDPDTALPVPVPDKHIP
PPLIHNSRILHQSSSPPSFALPRHMSEGSRPTYDEEPSKRQNAPSETSKLQQLVDQIDKG
KELEKNECHICHRVLSCQSALKLHYRTHTGERPYKCDLCSRAFTTRGNLRTHYSSVHRQQ
LRSSPPTNPSVMRGVSLQCPLCGSRFMDQQSMRQHMQMHLYMHSQQQQQVAHFLHGRHSE
GQIPLAFGGKFPPSIGENDIRMPDQITEIENQPLNRGADTSEPSDDVFEERSPDRETFSE
PDDRVVPVGSPHLRHESEERVNREQEPSSLSPPDGKGSERLPLPGTSDSIVTKPLTSGIS
ALDLTRSNSNSPGVITSSMYSNAGMMGHDNHRPSSQSPPQLASNSLNALAHLASNSAIMR
PSGMPLMNVRNPLEGEMRARKMTTSCEICNKPFTCQSALEIHLRIHTKERPYLCRVCERG
FTTKGNLKQHLLTHNINEVDDDLLEPVETSPITANSNSNSPVNSPATIVNSNQQALQRKR
PSESSDGQSTAKRTYPRHWCHICQKQFSSASSLQIHNRTHTGEKPFACSVCGRAFTTKGN
LKVHMGTHVWGAGGSRRGRRISMDNPLISPWMQNTSNSSSNPPSQAIRPRPAAPPIPAVP
DPALIYQQYAALASGLIGAKASAESRFHANGMLNLHNAAAARLLLPHPNGHVPPSSVGAQ
MGHHVPTAGEHVKGNERSNNIAAASEWIWKAYQRTQEQVN

Nucleotide sequence

Length: 3,314

>KH2012:KH.C4.671.v1.A.ND1-1
CGTCTGGCATGCCAGGAGTTAACGGTTTTCAGCTGCAAAATATGCAGCAGTTGTTAGCCG
CTGCCGCCTCAATGTCAGGGACTCATACGCCGGGAAACAATTTACTGCCGTCAGTTGCTG
CGGCCATTGGCCGTCATTTCAATGCAAATAATACTTCGGTGGAAAAATTAAGCGAAACAG
TTTCTCAGCTGCCCAAAGCTAAAGATCTTGAAATGGAAATGAACGAAGCTTTGCAGCAAG
AGAGTAGAACGAAGAATGGAGAGGAGGAAAACGCGTTTTATGGAGGAAGGATTGGAAACC
CTCAAGAGCCACCCCTTTACCGGAACTGCTCCGGGATGAACAGCGATCCCAATCAACCTG
GAGGGGGTTATGGGACTGCTGAAGAATCGCTTAGCAGGAGGTCGTGTCGCTTCTGCCAGA
AGGTTTTCGGTAGCGAGAGTGCGCTGCAGATTCATCTGCGTTCACACACAGGCGAACGGC
CGTTCAAATGCAACATCTGCGCAAATCGCTTCTCAACAAAGGGGAATCTAAAAGTCCATT
TTTCTCGGCACCAAGAAAAATATCCTCACATTGAAATGAATGCCAACCCCATACCTGAAT
ACTTAGACAACGTCCCTACGAGCTCTGGCATTCCGTACGGCATGTCAATTATCCCGGATT
TCACAGAGGTTCCAGACCCCGACACCGCTCTTCCAGTGCCAGTACCTGACAAACACATCC
CCCCACCTTTAATACACAATTCAAGAATACTACACCAGTCATCGTCACCGCCAAGTTTTG
CTCTTCCCCGCCACATGAGCGAGGGATCCCGGCCAACCTATGATGAAGAGCCGAGCAAAA
GGCAAAATGCTCCTTCTGAGACATCGAAACTACAACAACTGGTTGACCAAATCGACAAGG
GGAAGGAACTGGAAAAGAACGAGTGCCATATTTGCCATCGAGTTCTAAGCTGCCAAAGCG
CTCTCAAACTGCACTACAGAACGCACACAGGCGAAAGACCATACAAGTGCGACCTATGCT
CACGGGCTTTCACTACTCGAGGAAACTTGCGTACCCATTACAGCAGCGTTCATAGACAAC
AACTACGTTCATCTCCTCCCACCAACCCCTCAGTTATGCGTGGTGTGTCACTACAATGCC
CATTGTGCGGGAGCCGGTTTATGGACCAGCAGTCAATGCGACAACACATGCAAATGCATC
TGTACATGCACAGTCAGCAACAACAACAAGTTGCTCATTTCCTACACGGTCGCCACAGCG
AGGGACAAATACCTTTAGCGTTTGGAGGCAAATTCCCTCCCAGTATTGGTGAAAACGACA
TCCGCATGCCGGACCAAATAACTGAGATCGAGAATCAGCCTTTGAACCGAGGCGCCGATA
CATCAGAGCCAAGCGACGACGTTTTCGAAGAAAGGTCTCCGGATCGCGAAACCTTTTCGG
AACCCGACGACAGAGTAGTTCCCGTTGGTTCTCCGCATTTGCGGCATGAATCTGAAGAAC
GTGTAAACCGCGAGCAAGAACCTAGCTCTCTATCCCCACCTGACGGTAAAGGAAGTGAGC
GTCTACCGCTGCCTGGCACAAGCGACAGCATCGTGACCAAACCGCTTACCTCAGGGATAT
CAGCTCTCGACCTGACGCGCTCCAACTCCAACTCACCCGGTGTAATCACCTCATCAATGT
ACTCTAATGCGGGAATGATGGGACACGACAACCATCGCCCATCTTCTCAATCGCCACCTC
AACTTGCATCCAACTCACTGAACGCTCTTGCGCACCTCGCGTCGAACAGCGCCATAATGC
GGCCAAGCGGAATGCCACTAATGAACGTGCGGAACCCGCTAGAGGGCGAAATGCGGGCAA
GAAAGATGACGACATCGTGTGAGATTTGCAACAAACCATTCACCTGCCAAAGCGCGTTAG
AGATACATTTACGAATTCATACAAAGGAAAGACCATACTTGTGCCGAGTTTGCGAGCGAG
GGTTTACCACCAAAGGTAACCTAAAGCAACATTTACTAACACACAATATCAACGAGGTGG
ACGACGACCTACTGGAGCCGGTTGAAACCTCTCCTATAACTGCTAACTCGAATTCAAACA
GTCCTGTAAACTCGCCCGCTACCATCGTTAATTCAAACCAACAAGCATTACAAAGAAAGC
GTCCAAGCGAAAGTAGCGACGGCCAATCAACAGCAAAGCGGACCTATCCTAGACATTGGT
GCCATATATGCCAGAAACAGTTTTCATCGGCTAGTTCACTACAGATACACAACAGAACAC
ATACAGGCGAAAAGCCGTTTGCTTGCAGCGTATGTGGACGAGCATTTACAACCAAAGGAA
ACCTAAAGGTCCATATGGGCACGCACGTCTGGGGAGCAGGGGGTTCACGGCGCGGTAGAC
GTATCTCGATGGACAACCCATTAATTTCACCCTGGATGCAAAATACTTCGAACTCTAGCT
CTAACCCACCAAGTCAAGCTATAAGACCGAGACCAGCAGCTCCCCCAATACCAGCTGTGC
CAGACCCTGCCCTAATCTACCAGCAATACGCGGCGCTTGCTTCAGGATTGATCGGGGCGA
AGGCATCAGCAGAGTCGAGGTTTCATGCGAACGGGATGCTTAATCTTCACAATGCAGCTG
CAGCTAGATTACTTTTGCCGCATCCGAATGGTCATGTTCCGCCATCTTCTGTTGGTGCAC
AAATGGGACACCATGTACCAACGGCAGGGGAGCATGTGAAGGGGAATGAGAGATCTAATA
ATATTGCTGCTGCGTCAGAATGGATTTGGAAAGCTTACCAAAGAACACAAGAACAGGTGA
ATTAAAAGTACAAGAACCAAACCTAACCACGAAACAAGATATGACTTCGACATCAACGTA
TTATTTCTTCACTCTTATATATACCACCTGAAAGAATTCTTCAAATACGAGAAAAACGTT
ATCTTATGTACTCAAAATACCTTGCTTGTCGTCTAGTATATTGTATAGATAGATTTTAGT
CCTTTTTTGTGTACGTAAGCTGCGAGTAGTGATGCAGTAATATTATACTTGCGCTAGACA
GAGTAGTTTGAAGCTTTACCGAGAGTTGTACTTGCAATAACCTAACTTTTTTAATTGCTG
CTATACCTACTGAGAGGCAGCCTCGATAAAACACTGTAGCAAGTTCATGTTATTGTTGTT
GTGTGGGCGCTGAACGAAACTTGTCGTGCCGTGGAGTTTTGTGCGTTGTATATATTCTAG
TCAAATGTGATAAGCCAATGAATGTCATTTAACTGTCATTGTCGACACTGCGGTTATAAT
AAACTAAGACAAAA

InterProScan

ProSiteProfiles
Znf_C2H2_type (IPR013087) - T[133-160] 14.212 - T[161-188] 11.365 - T[306-333] 13.152 - T[334-362] 11.635 - T[377-408] 10.097 - T[624-651] 14.004 - T[652-674] 10.554 - T[738-765] 14.96 - T[766-788] 11.801
SMART
Znf_C2H2_type (IPR013087) - T[133-155] 0.0049 - T[161-183] 0.0097 - T[306-328] 5.4 - T[334-357] 3.4E-4 - T[377-399] 3.6E-4 - T[624-646] 0.035 - T[652-674] 0.003 - T[738-760] 0.69 - T[766-788] 1.6E-4
SUPERFAMILY
Znf_C2H2_sf (IPR036236) - T[134-183] 1.46E-13 - T[307-353] 9.19E-13 - T[625-674] 1.61E-14 - T[738-788] 2.14E-16
ProSitePatterns
Znf_C2H2_type (IPR013087) - T[135-155] . - T[163-183] . - T[308-328] . - T[336-357] . - T[379-399] . - T[626-646] . - T[654-674] . - T[740-760] . - T[768-788] .

Best Blast Hits in UniProt
Protein Name Identity Bit Score e-value
SALL1_HUMAN 30.912 % 232 6.48E-63
SALL3_HUMAN 47.17 % 194 1.44E-50
SALL4_HUMAN 40.173 % 224 1.14E-60