Login
Help

TRANSCRIPT CARD

Submit your Data

  1. Transcript 'KH2012:KH.C1.244.v1....'
  2. Transcript 'Cisavi.CG.ENS81.R442...'
  3. Transcript 'KH2012:KH.C1.244.v1....'

Transcript Model

Transcript Id

KH2012:KH.C1.244.v1.A.SL2-1

Possible name(s)

COL4A1; COL4A2; COL4A6

Location

KhC1 [5,607,649 / 5,618,616]

Sequences

Amino acid sequence

Length: 1,767

>KH.C1.244.v1.A.SL2-1
CQIRPVTSKRRDNMAQFVHLPNRYTWLLFVLFVSLSLEINAERSIKKLKGPCGGRDCSKL
GKCGFCVSEKGRRGPPGAQGPTGLQGEIGFPGPEGVMGPKGYGGLPGRPGKAGEKGDRGT
IGVPGYSGINGVPGHPGESGPRGYPGKDGCNGTMGDMGPAGPPGYNGLDGLPGPDGMKGA
KGEPAYVNVEDLTKGARGERGHTGPKGPSGRPGTTGPLGPKGYPGKPGPPGRPGSKGRTG
PQGHKGIGYQGESGEKGEPGPPGLPGSPGFLTGPPDSLFTPIPGPPGNNGSKGMKGNVGM
QGEPGYPGFEGERGRFGEKGEKGVRGLAGGRGPTGAAGVPGYDGRQGERGMPGRSGIDGA
PGMKGEQGEQGPVGAAGLPGLQGLQGPIGSPGSPGNVGKPGTAGPHGQPGINGFDGPPGE
DGLPGQIGLPGNPGVPGVGEKGEPGQKGRNGKHGQDGVPGLPGEEGPRGFPGISGESIPG
LNGRDGPPGLRGEPGQPGARGVAGAPGQIITPDGDHMVGPQGPTGIPGVRGLKGLKGVAG
RDGGPGEKGEKGGECACQDAVGEKGSPGPPGQNGLPGLSGLPGISGQLGEPGEPGESGEN
GPRGFDGQKGRTGEIGPPGQKGEPATLVGDVKGEPGEPGLPGGQGEPGIAGIPGQDGRSG
RRGAHGEKGEVGPQGLPGLPGLKGFQGLKGREGRAGQDAFGLPGQIGFKGEKGDEGFTGP
QGFPGSKGEPGESLGGVAPKGEKGETGSPGRVGFPGLKGTKGEQGKTGVEGVTGDDGERG
DTGEPGLPGVPGEQGLRGPQGESGLPGVPGLSGERGVTGIRGGNGMKGEKGKDGVSYPGP
AGPAGQKGEVGEPGAKGESGSQGFPGLVGLPGPPGLPGLEGTPGLEGLPGKDGSPGEKGE
SALVGRRGPPGPEGPSGVTGPPGKPGIKGNRGPPGLPSSGKLRGPKGSIGFAGRDGETGL
KGDKGSTGLPGETGKPGPAGADGLPGTPGPPGPRGANGRTGPKGSDGIDGLPGLDGMSGL
YGKKGAPGKQGATGPQGFKGEKGSLPPGGLVDVRGQTGEKGDTGPVGEPGQQGLPGPDGP
KGNRGNQGIKGSTGVSGIPGEYGRNGLPGVEGEKGAKGARGRIGLPGVVGRPGPIGETGR
TGLPGPSGFKGQKGMLGEAGQPGLPGREGSPGLHGENGPKGMQGRRGLPGLSGLDGPSGQ
KGERGAIGQSGPKGYPGLVGMKGGRGLPGLDGRDGLNGEPGEDGAPGFDGLDGRPGRRGE
KGKPGVSNVAGPPGATGITGVKGETGLSGLPGESGPIGLKGKRGNPGPAGFSGRPGPVGE
QGLPGFPGPKGEPGLPGGVGIPGRQGLPGKDGQEGFTGHQGLPGIKGMPGLPGQNGLDGV
PGIQGDTGPAGLVGLTGPSGQKGSVGLPGSHGFSGDKGVRGFPGNPGRPGFPGLVGEPGF
KGEPGRSIEPTDLVAGPKGNTGKPGLPGSPGLIGRTGLPGLQGFKGDQGERGLDGRDGIP
GSHGQRGNPGPRGFIGPKGSPGRDGTPGRSGVAGPAGRVRPPGHLIVRHSQTVYIPECPA
GMTKLWEGYSLLYLEGSEKAHGQDLGQAGSCMPRFNTMPFMYCNTQSVCKYGSRNDKSYW
LSTTAAIPMMPVSVDMVPEYISRCSVCESSSIAMAVHSQDMVIPPCPDGWKGIWLGYSFA
MHTAAGAEGGGQSLSSPGSCLQDFRATPFIECNGARGHCFFYNNQYSFWLTTISEENQFG
TPEMETLKAGNLRTRVSRCQVCTRLNQ

Nucleotide sequence

Length: 5,971

>KH2012:KH.C1.244.v1.A.SL2-1
CAGGGTGTAGGATATTATACCACAGACAGGAAGCGGGGAGCAGTTTACATCGGGTCGTCT
GCGGAAGCTTGGCGGCTCTTTGACGGCGAGATCTCCGTAGTGCCAAATTCGGCCAGTTAC
GTCAAAAAGAAGAGACAATATGGCGCAATTTGTTCATTTACCGAATAGATACACATGGCT
GCTGTTCGTGTTGTTCGTTTCACTGAGTCTGGAAATCAACGCAGAGCGATCTATCAAGAA
GCTGAAAGGTCCTTGTGGTGGTCGCGATTGTTCAAAGTTGGGGAAATGCGGATTCTGCGT
TTCAGAGAAAGGCCGCCGGGGACCCCCTGGTGCTCAAGGACCGACCGGTCTGCAAGGAGA
GATTGGTTTCCCGGGTCCTGAAGGAGTCATGGGCCCGAAAGGATATGGTGGTCTTCCTGG
CAGACCTGGGAAAGCTGGAGAGAAAGGAGACAGGGGGACTATTGGCGTGCCTGGATATTC
AGGGATCAACGGAGTTCCTGGACACCCTGGAGAGTCTGGTCCCAGAGGATACCCCGGTAA
AGATGGGTGCAATGGAACCATGGGTGACATGGGACCTGCAGGACCTCCTGGATACAATGG
CTTAGATGGATTACCGGGTCCTGATGGAATGAAAGGTGCAAAGGGAGAACCTGCTTATGT
GAATGTTGAGGACCTTACTAAAGGTGCAAGAGGTGAACGTGGCCACACTGGACCAAAAGG
ACCTTCTGGAAGACCTGGTACCACTGGTCCCTTGGGTCCCAAAGGTTACCCTGGCAAACC
AGGTCCACCTGGAAGGCCAGGAAGTAAAGGTCGAACAGGTCCTCAAGGCCATAAAGGAAT
TGGTTACCAGGGAGAGAGTGGTGAAAAGGGTGAACCTGGCCCTCCAGGATTACCTGGTAG
CCCTGGATTTTTGACTGGCCCTCCTGACAGTTTGTTTACACCAATTCCTGGACCACCAGG
AAATAATGGGTCAAAGGGAATGAAGGGTAATGTCGGCATGCAAGGAGAACCTGGTTACCC
AGGATTTGAAGGAGAAAGAGGAAGATTTGGTGAAAAGGGAGAGAAGGGCGTTCGTGGATT
GGCTGGAGGACGAGGACCCACTGGTGCAGCCGGAGTACCGGGTTATGATGGCAGACAGGG
AGAACGTGGGATGCCTGGAAGAAGTGGAATTGATGGAGCTCCCGGTATGAAAGGGGAGCA
AGGAGAGCAAGGTCCAGTAGGAGCAGCAGGTCTTCCTGGATTACAGGGCCTACAAGGACC
TATTGGTTCTCCCGGGTCACCAGGAAATGTTGGTAAACCAGGTACAGCAGGACCACATGG
TCAGCCAGGAATAAATGGTTTTGATGGACCACCTGGTGAAGATGGTCTACCTGGTCAGAT
AGGTCTGCCTGGTAACCCTGGTGTGCCTGGTGTTGGTGAAAAAGGAGAACCAGGACAGAA
AGGTCGTAATGGTAAACATGGTCAGGATGGTGTCCCAGGTTTACCTGGTGAAGAAGGGCC
AAGAGGATTTCCAGGAATTTCTGGTGAATCCATTCCTGGACTGAATGGAAGAGATGGCCC
GCCCGGATTACGGGGTGAACCTGGACAGCCTGGTGCGAGAGGTGTAGCTGGTGCACCTGG
TCAAATTATCACACCAGATGGAGATCATATGGTTGGTCCACAAGGTCCAACAGGAATACC
CGGTGTACGTGGTTTAAAGGGTCTCAAAGGTGTTGCTGGTAGGGATGGTGGCCCTGGAGA
AAAAGGAGAGAAAGGGGGTGAATGTGCGTGTCAAGATGCAGTAGGGGAAAAAGGAAGTCC
TGGACCTCCTGGCCAAAATGGTCTGCCTGGACTATCAGGGTTACCTGGCATCAGTGGCCA
GCTGGGTGAGCCTGGTGAACCAGGAGAATCTGGTGAAAACGGACCAAGAGGATTTGATGG
CCAAAAAGGAAGAACTGGTGAGATAGGTCCACCAGGTCAAAAAGGTGAACCAGCAACTTT
GGTCGGAGATGTTAAAGGAGAACCAGGTGAACCAGGTTTGCCTGGTGGTCAAGGTGAACC
AGGGATTGCTGGAATTCCTGGTCAAGATGGCAGATCTGGTAGAAGAGGAGCACATGGAGA
AAAGGGTGAAGTTGGACCGCAAGGTTTGCCAGGTCTACCGGGTTTAAAAGGTTTCCAAGG
ACTTAAAGGTAGAGAAGGGAGAGCTGGACAAGATGCGTTTGGTTTGCCTGGTCAAATTGG
CTTCAAAGGAGAAAAAGGAGATGAAGGATTCACAGGACCACAGGGTTTTCCTGGATCAAA
AGGAGAACCTGGTGAGTCACTAGGAGGAGTTGCTCCTAAAGGAGAAAAAGGAGAGACAGG
ATCACCGGGTAGAGTTGGATTTCCTGGGCTGAAAGGTACAAAAGGAGAACAAGGAAAAAC
TGGGGTTGAAGGTGTCACCGGGGATGATGGAGAGAGAGGTGACACTGGTGAACCTGGATT
ACCTGGAGTGCCAGGAGAACAAGGTCTAAGAGGACCACAAGGCGAATCTGGCTTGCCAGG
GGTACCTGGTTTATCTGGTGAACGTGGTGTCACTGGTATACGAGGAGGCAATGGTATGAA
GGGTGAAAAAGGTAAAGATGGTGTTTCATACCCTGGTCCTGCTGGCCCTGCTGGCCAGAA
AGGTGAAGTAGGAGAACCGGGAGCTAAGGGAGAGTCTGGAAGCCAAGGATTCCCAGGATT
AGTTGGCCTGCCTGGTCCTCCTGGATTACCTGGTTTAGAGGGAACTCCAGGTTTAGAGGG
TTTGCCTGGAAAAGATGGTTCACCAGGTGAAAAGGGTGAATCAGCACTTGTTGGAAGGAG
AGGTCCACCTGGACCAGAAGGACCAAGTGGTGTTACTGGTCCACCAGGTAAACCTGGGAT
TAAAGGTAATAGAGGACCACCAGGTTTACCATCAAGTGGTAAACTAAGGGGACCAAAGGG
ATCAATTGGATTTGCTGGAAGAGATGGTGAAACTGGTCTTAAAGGTGACAAGGGGTCAAC
TGGTTTGCCTGGTGAAACTGGTAAACCCGGTCCTGCTGGAGCAGATGGTCTTCCTGGAAC
TCCCGGTCCACCTGGACCTCGAGGTGCCAATGGTAGAACTGGACCAAAGGGAAGCGATGG
CATAGATGGCTTACCTGGTCTTGATGGTATGTCAGGACTTTATGGCAAAAAGGGAGCTCC
AGGTAAACAGGGAGCGACTGGTCCTCAAGGATTTAAAGGTGAAAAAGGTTCACTACCTCC
GGGTGGTTTGGTCGATGTCAGAGGACAAACTGGTGAGAAGGGTGACACTGGGCCAGTAGG
AGAACCTGGACAGCAAGGTCTTCCTGGACCTGATGGACCCAAAGGAAACAGGGGAAACCA
AGGTATTAAGGGTTCCACTGGTGTTTCTGGAATACCAGGAGAATACGGGCGGAATGGTTT
ACCTGGAGTGGAAGGAGAAAAAGGGGCAAAAGGAGCAAGAGGACGTATTGGACTACCGGG
TGTTGTTGGTAGACCTGGTCCCATTGGCGAAACTGGCAGAACTGGTTTGCCAGGACCTTC
GGGTTTTAAGGGACAGAAAGGTATGTTGGGTGAAGCAGGTCAACCTGGCCTACCTGGTAG
GGAGGGAAGTCCTGGGTTACATGGAGAAAATGGACCCAAGGGAATGCAAGGCAGAAGAGG
TTTGCCTGGACTAAGTGGATTAGATGGACCATCTGGTCAAAAGGGTGAAAGAGGTGCCAT
TGGACAATCAGGACCAAAAGGTTACCCAGGGTTGGTGGGAATGAAAGGTGGTCGTGGTTT
GCCTGGTTTGGATGGACGAGATGGTTTGAATGGTGAACCAGGGGAGGATGGAGCACCTGG
ATTTGATGGTTTGGATGGTCGACCTGGAAGGAGAGGAGAAAAGGGAAAACCAGGTGTATC
TAATGTTGCGGGTCCACCAGGTGCAACTGGAATTACAGGTGTTAAAGGAGAAACTGGTTT
ATCCGGTTTACCTGGTGAATCAGGTCCCATTGGGCTTAAGGGGAAGAGAGGCAACCCAGG
ACCAGCGGGATTTTCTGGAAGACCTGGTCCAGTTGGTGAACAAGGTTTGCCAGGTTTTCC
TGGACCAAAAGGAGAACCAGGTCTACCTGGAGGAGTTGGTATCCCTGGCAGACAAGGTTT
ACCTGGAAAGGATGGACAAGAAGGGTTTACTGGGCATCAAGGTTTGCCTGGAATTAAGGG
AATGCCTGGGTTGCCAGGCCAAAATGGTCTTGATGGTGTTCCTGGTATTCAAGGAGATAC
AGGACCAGCAGGTTTAGTTGGTCTTACTGGACCTTCAGGCCAAAAAGGAAGTGTTGGATT
ACCAGGATCTCATGGGTTTTCTGGTGACAAAGGTGTAAGAGGATTTCCAGGAAACCCAGG
ACGGCCAGGTTTTCCAGGTCTTGTAGGAGAACCTGGGTTCAAAGGTGAACCAGGTAGGTC
TATTGAACCAACAGATCTTGTTGCTGGACCAAAAGGAAACACTGGAAAACCTGGTTTACC
TGGATCACCTGGTTTGATTGGCAGAACTGGTCTTCCAGGACTACAAGGTTTTAAGGGTGA
TCAAGGTGAACGTGGTTTGGATGGACGTGATGGAATACCAGGTTCACATGGACAGAGAGG
AAATCCAGGCCCTCGTGGCTTTATTGGACCTAAAGGAAGCCCAGGACGAGATGGAACTCC
TGGTCGATCTGGTGTAGCTGGTCCCGCTGGACGTGTTCGTCCTCCAGGCCATCTTATTGT
TCGTCACAGTCAAACTGTTTATATACCGGAGTGTCCAGCTGGAATGACTAAACTTTGGGA
AGGTTACAGTTTGCTTTACCTTGAAGGGAGTGAAAAGGCTCACGGTCAAGATTTGGGTCA
AGCGGGGTCTTGCATGCCACGATTCAACACGATGCCATTCATGTACTGTAACACACAAAG
TGTGTGCAAGTATGGAAGTAGGAATGATAAGTCATATTGGTTATCCACCACAGCTGCTAT
TCCTATGATGCCCGTTTCTGTTGATATGGTTCCTGAATACATCAGTAGATGCTCAGTATG
TGAATCATCTTCAATTGCTATGGCAGTCCACAGTCAAGATATGGTAATTCCACCTTGCCC
AGATGGATGGAAAGGAATCTGGCTTGGCTACAGTTTTGCTATGCATACAGCGGCTGGTGC
TGAAGGTGGAGGTCAGTCTCTATCCAGCCCAGGTTCTTGCTTACAAGACTTCCGTGCAAC
ACCTTTCATAGAGTGTAACGGTGCAAGAGGTCACTGCTTCTTTTACAACAACCAGTACAG
TTTCTGGCTTACGACAATTTCTGAGGAAAATCAATTTGGAACACCTGAAATGGAAACGCT
GAAAGCTGGAAACTTACGAACAAGGGTTAGTCGATGCCAAGTGTGCACACGGTTGAATCA
GTAAAGAGCTCCCATTGGGAACAAGGAAACTCTTTGAGCACACTAACTCAGTTCAAAACA
AAACCGAGAACAAATTCTGCTCACACATATGCCTTGTAGCCAACAATCATTGCATTTTTC
CATTAACGCCATTCAGCATATACTTTACATTGAGTTGGTTTAATCGTGTGCGTATGTTGC
CAATTGTCCAATACCACTTTCAGGGGACAATGCTTTTAACTAATATCATTCAAACTGAAT
GAGGCAGTATTCAGCACAATACAGCATACTTACAGTGTTTGTTGTTTCTGTACATAACGT
GATTTTTTACTGTTATATCATTTATACCTTTATAGCGAATCTCTTTCCTTCCCTAAAAAT
ACTGAACCAGCTACAGGTGCCTTGATAGCAAATCCACGTCAGTCAGGAGAATATAATTTA
TTTTAAAGGCAACGTTAATTTTTCTGACCGCCCGTTTTTTCAACCTCCCATGCTATAAAC
GTACAATAGTTTATAACTACACATGCTTACTCAAACCGTTGTTTCATTTTGTGTGCAACT
ATGCCCGAGTTAAATTAAATGATTTACACGA

InterProScan

Pfam
Collagen (IPR008160) - T[71-128] 6.5E-8 - T[125-184] 4.1E-9 - T[195-246] 3.0E-7 - T[284-341] 2.7E-7 - T[335-393] 1.4E-7 - T[380-437] 7.0E-7 - T[633-691] 8.9E-8 - T[741-795] 1.9E-8 - T[779-834] 6.7E-8 - T[838-890] 1.2E-8 - T[878-935] 5.1E-8 - T[943-996] 2.6E-8 - T[1328-1386] 1.0E-7 - T[1388-1446] 1.9E-9 - T[1456-1507] 6.7E-8 - T[1480-1538] 1.3E-8
Gene3D
Collagen_IV_NC_sf (IPR036954) - T[1540-1765] 3.2E-109
ProSiteProfiles
Collagen_IV_NC (IPR001442) - T[1543-1766] 113.154
SMART
Collagen_IV_NC (IPR001442) - T[1543-1650] 9.1E-58 - T[1651-1765] 6.5E-66
Pfam
Collagen_IV_NC (IPR001442) - T[1545-1648] 1.4E-35 - T[1653-1763] 2.6E-41
SUPERFAMILY
CTDL_fold (IPR016187) - T[1545-1651] 4.94E-42 - T[1652-1763] 4.24E-43

Best Blast Hits in UniProt
Protein Name Identity Bit Score e-value
CO4A2_HUMAN 47.217 % 614 0
CO4A1_HUMAN 40.147 % 571 7.01E-174
CO4A6_HUMAN 42.834 % 741 0