Claims (54)
1. A reverse transcription reaction composition comprising ; : mRNA; a reverse transcriptase; and at least 1000 unique oligonucleotide species, each unique oligonucleotide species comprising a barcode region and a uniform region, the barcode region comprising a molecule barcode comprising at least 7 nucleotides and having a constraint on G content, wherein the uniform region is 3′ of the barcode region, the uniform region comprising a target-specific region 3′ of the barcode region, the target-specific region comprising at least 10 nucleotides complementary to a target nucleic acid, wherein the unique oligonucleotide species comprise different nucleic acid sequences in their barcode regions, wherein the constraint on G content comprises: (a) less than 1% of the unique oligonucleotide species comprising the molecule barcode have having a G content of 50% or more : ; and/or (b) the molecule barcodes of all of the unique oligonucleotide species in the composition collectively have a G content of no more than 12.5%, and wherein the barcode region further comprises a sample barcode comprising at least 3 nucleotides.
16. A method of specifically barcoding cDNA from two or more samples, each sample comprising nucleic acids, the method comprising: contacting each sample with a pool comprising at least 100 unique oligonucleotide species, wherein each sample is contacted in spatial isolation from the other samples, each unique oligonucleotide species comprising: a barcode region comprising ; : a molecule barcode comprising at least 7 nucleotides and having a constraint on G content; and a sample barcode comprising at least 3 nucleotides; and a uniform region 3′ of the barcode region, the uniform region comprising a target-specific region 3′ of the barcode region, the target-specific region comprising at least 10 nucleotides complementary to a target nucleic acid, wherein the unique polynucleotide oligonucleotide species of each pool comprise the same sample barcode, and comprise different molecule barcodes, and wherein the constraint on G content comprises: (a) less than 1% of the unique oligonucleotide species comprising the molecule barcode have having a G content of 50% or more; and/or (b) the molecule barcodes of all of the unique oligonucleotide species collectively have a G content of no more than 12.5%; hybridizing target-specific regions of at least some oligonucleotides of the unique oligonucleotide species to at least some of the nucleic acids of the sample, wherein the nucleic acids comprise mRNA; and extending the hybridized oligonucleotides by reverse transcription, thereby producing extended strands comprising an oligonucleotide of the unique oligonucleotide species and a sequence complementary to the target, wherein for each sample, the extended strands comprise the same sample barcode and different molecule barcodes, wherein for different samples, the molecule barcodes are different, and wherein a bias in representation of the unique oligonucleotide species in an amplification product of the extended strands is reduced in comparison to the use of control unique oligonucleotide species comprising control molecule barcodes without the constraint on G content.
22. A kit for amplifying barcoded cDNA comprising an immune cell receptor or immunoglobulin variable region coding sequence, comprising ; : a composition comprising: at least 1000 unique oligonucleotide species, each unique oligonucleotide species comprising a barcode region and a uniform region, the barcode region comprising a molecule barcode comprising at least 7 nucleotides and having a constraint on G content, wherein the uniform region is 3′ of the barcode region, the uniform region comprising a target-specific region 3′ of the barcode region, the target-specific region comprising at least 10 nucleotides complementary to a target nucleic acid, wherein the unique oligonucleotide species comprise different nucleic acid sequences in their barcode regions, wherein the constraint on G content comprises: (a) less than 1% of the unique oligonucleotide species comprising the molecule barcode have having a G content of 50% or more; and/or (b) the molecule barcodes of all of the unique oligonucleotide species in the composition collectively have a G content of no more than 12.5%, wherein the barcode region further comprises a sample barcode comprising at least 3 nucleotides, and wherein the uniform region comprises a target-specific region comprising a sequence flanking an immune cell receptor or immunoglobulin variable region coding sequence; and a primer configured to hybridize on an opposite side of the variable region as the target-specific region, and to hybridize to a complementary strand to a strand hybridized by the target-specific region, and is thereby configured to amplify the variable region in conjunction with the target-specific region.
25. A composition comprising: at least 1000 unique oligonucleotide species, each unique oligonucleotide species comprising a barcode region and a uniform region, the barcode region comprising a molecule barcode comprising at least 7 nucleotides and having a constraint on G content, wherein the uniform region is 3′ of the barcode region, the uniform region comprising a target-specific region 3′ of the barcode region, the target-specific region comprising at least 10 nucleotides complementary to a target nucleic acid, wherein the unique oligonucleotide species comprise different nucleic acid sequences in their barcode regions, wherein the constraint on G content comprises: (a) less than 1% of the unique oligonucleotide species comprising the molecule barcode having a G content of 50% or more; and/or (b) the molecule barcodes of all of the unique oligonucleotide species in the composition collectively have a G content of no more than 12.5%, and wherein the barcode region further comprises a sample barcode comprising at least 3 nucleotides.
Show 50 dependent claims
2. The composition of claim 1 , wherein the constraint on G content is less than 1% of the unique oligonucleotide species comprising the molecule barcode have having a G content of 50% or more.
3. The composition of claim 1 , wherein the constraint on G content is the molecule barcodes of all of the unique oligonucleotide species in the composition collectively have a G content of no more than 12.5%.
4. The composition of claim 1 , wherein the unique oligonucleotide species are disposed in at least two spatially isolated pools, each pool comprising at least 100 unique oligonucleotides of the unique oligonucleotide species, wherein unique oligonucleotides in the same pool comprise the same sample barcode sequence, and wherein different unique oligonucleotides of the same pool comprise a different molecule barcode sequences.
5. The composition of claim 4 , wherein the unique oligonucleotide species of each pool are immobilized on a substrate, so that the sample barcodes but not the molecule barcodes are the same for the oligonucleotide species immobilized on each substrate.
6. The composition of claim 1 , wherein for at least 95% of the unique oligonucleotide species, any G in the molecule barcode is not adjacent to another G.
7. The composition of claim 1 , wherein at least 95% of the molecule barcodes of the unique oligonucleotide species comprise the sequence HNHNHNHN, wherein each “H” is any one of A, C, or T, and wherein each “N” is any one of A, G, C, or T.
8. The composition of claim 1 , wherein each of the unique oligonucleotide species comprises a spacer 3′ of the barcode region and 5′ of the target specific region, said spacer comprising the sequence HHHHHHHH, wherein each “H” is any one of A, C, or T.
9. The composition of claim 1 , wherein each oligonucleotide species has a length of 24-140 nucleotides.
10. The composition of claim 1 , wherein the composition comprises at least two oligonucleotides of the same unique oligonucleotide species.
11. The composition of claim 1 , wherein the uniform region comprises a target-specific region comprising a sequence flanking an immune cell receptor or immunoglobulin variable region coding sequence.
12. The composition of claim 11 , wherein the immune cell receptor variable region coding sequence is selected from the group consisting of: a T cell receptor variable region coding sequence, a B cell receptor variable region coding sequence, and a combination thereof.
13. The composition of claim 1 , wherein the molecule barcodes of all of the unique oligonucleotide species in the composition collectively have a G content of 2.5%-10%.
14. The composition of claim 1 , wherein the molecule barcode is 7-9 nucleotides.
15. The composition of claim 1 , comprising at least 6,500 unique oligonucleotide species.
17. The method of claim 16 , further comprising ascertaining nucleic acid sequences of the strands comprising the oligonucleotides of the unique oligonucleotide species and the sequence complementary to the target.
18. The method of claim 16 , wherein the constraint on G content is the molecule barcodes of the unique oligonucleotide species collectively having a G content of less than 12.5%.
19. The method of claim 16 , wherein the molecule barcode is 7-9 nucleotides.
20. The method of claim 16 , the pool comprises at least 1000 unique oligonucleotide species.
21. The method of claim 16 , the pool comprises at least 6,500 unique oligonucleotide species.
23. The kit of claim 22 , wherein the molecule barcode is 7-9 nucleotides.
24. The kit of claim 22 , wherein composition comprises at least 6,500 unique oligonucleotide species.
26. The composition of claim 25 , wherein the constraint on G content is less than 1% of the unique oligonucleotide species comprising the molecule barcode having a G content of 50% or more.
27. The composition of claim 25 , wherein the constraint on G content is the molecule barcodes of all of the unique oligonucleotide species in the composition collectively have a G content of no more than 12.5%.
28. The composition of claim 25 , wherein the unique oligonucleotide species are disposed in at least two spatially isolated pools, each pool comprising at least 100 unique oligonucleotides of the unique oligonucleotide species, wherein unique oligonucleotides in the same pool comprise the same sample barcode sequence, and wherein different unique oligonucleotides of the same pool comprise a different molecule barcode sequences.
29. The composition of claim 28 , wherein the unique oligonucleotide species of each pool are immobilized on a substrate, so that the sample barcodes but not the molecule barcodes are the same for the oligonucleotide species immobilized on each substrate.
30. The composition of claim 25 , wherein for at least 95% of the unique oligonucleotide species, any G in the molecule barcode is not adjacent to another G.
31. The composition of claim 25 , wherein at least 95% of the molecule barcodes of the unique oligonucleotide species comprise the sequence HNHNHNHN, wherein each “H” is any one of A, C, or T, and wherein each “N” is any one of A, G, C, or T.
32. The composition of claim 25 , wherein each of the unique oligonucleotide species comprises a spacer 3′ of the barcode region and 5′ of the target specific region, said spacer comprising the sequence HHHHHHHH, wherein each “H” is any one of A, C, or T.
33. The composition of claim 25 , wherein each oligonucleotide species has a length of 24-140 nucleotides.
34. The composition of claim 25 , wherein the composition comprises at least two oligonucleotides of the same unique oligonucleotide species.
35. The composition of claim 25 , wherein the uniform region comprises a target-specific region comprising a sequence flanking an immune cell receptor or immunoglobulin variable region coding sequence.
36. The composition of claim 35 , wherein the immune cell receptor variable region coding sequence is selected from the group consisting of: a T cell receptor variable region coding sequence, a B cell receptor variable region coding sequence, and a combination thereof.
37. The composition of claim 25 , wherein the molecule barcodes of all of the unique oligonucleotide species in the composition collectively have a G content of 2.5%-10%.
38. The composition of claim 25 , wherein the molecule barcode is 7-9 nucleotides.
39. The composition of claim 25 , comprising at least 6,500 unique oligonucleotide species.
40. The method of claim 16 , wherein the constraint on G content is less than 1% of the unique oligonucleotide species comprising the molecule barcode having a G content of 50% or more.
41. The method of claim 16 , wherein the unique oligonucleotide species of each pool are immobilized on a substrate, so that the sample barcodes but not the molecule barcodes are the same for the oligonucleotide species immobilized on each substrate.
42. The method of claim 16 , wherein for at least 95% of the unique oligonucleotide species, any G in the molecule barcode is not adjacent to another G.
43. The method of claim 16 , wherein at least 95% of the molecule barcodes of the unique oligonucleotide species comprise the sequence HNHNHNHN, wherein each “H” is any one of A, C, or T, and wherein each “N” is any one of A, G, C, or T.
44. The method of claim 16 , wherein each of the unique oligonucleotide species comprises a spacer 3′ of the barcode region and 5′ of the target specific region, said spacer comprising the sequence HHHHHHHH, wherein each “H” is any one of A, C, or T.
45. The method of claim 16 , wherein each oligonucleotide species has a length of 24-140 nucleotides.
46. The method of claim 16 , wherein the target-specific region comprises a sequence flanking an immune cell receptor or immunoglobulin variable region coding sequence.
47. The method of claim 46 , wherein the immune cell receptor variable region coding sequence is selected from the group consisting of: a T cell receptor variable region coding sequence, a B cell receptor variable region coding sequence, and a combination thereof.
48. The kit of claim 22 , wherein the constraint on G content is the molecule barcodes of the unique oligonucleotide species collectively having a G content of less than 12.5%.
49. The kit of claim 22 , wherein the constraint on G content is less than 1% of the unique oligonucleotide species comprising the molecule barcode having a G content of 50% or more.
50. The kit of claim 22 , wherein the unique oligonucleotide species of each pool are immobilized on a substrate, so that the sample barcodes but not the molecule barcodes are the same for the oligonucleotide species immobilized on each substrate.
51. The kit of claim 22 , wherein for at least 95% of the unique oligonucleotide species, any G in the molecule barcode is not adjacent to another G.
52. The kit of claim 22 , wherein at least 95% of the molecule barcodes of the unique oligonucleotide species comprise the sequence HNHNHNHN, wherein each “H” is any one of A, C, or T, and wherein each “N” is any one of A, G, C, or T.
53. The kit of claim 22 , wherein each of the unique oligonucleotide species comprises a spacer 3′ of the barcode region and 5′ of the target specific region, said spacer comprising the sequence HHHHHHHH, wherein each “H” is any one of A, C, or T.
54. The kit of claim 22 , wherein each oligonucleotide species has a length of 24-140 nucleotides.
Full Description
No description text available for this patent.
Citations
This patent cites (456)
- US4510244
- US4725536
- US5124246
- US5149625
- US5200314
- US5424186
- US5424413
- US5445934
- US5604097
- US5635352
- US5635400
- US5648245
- US5654413
- US5656731
- US5658737
- US5714330
- US5744305
- US5759778
- US5763175
- US5800992
- US5846719
- US5854033
- US5871928
- US5925525
- US5935793
- US5962271
- US5962272
- US5968740
- US5981176
- US5981179
- US6013445
- US6040138
- US6046005
- US6060596
- US6064755
- US6114149
- US6117631
- US6124092
- US6138077
- US6140489
- US6172214
- US6194454
- US6197506
- US6197554
- US6235475
- US6235483
- US6265163
- US6268152
- US6284460
- US6284485
- US6309822
- US6309823
- US6326148
- US6355431
- US6355432
- US6372813
- US6395491
- US6406848
- US6436675
- US6440667
- US6440706
- US6451536
- US6458530
- US6468744
- US6480791
- US6489114
- US6492121
- US6512105
- US6514699
- US6544739
- US6551784
- US6576424
- US6600996
- US6629040
- US6653077
- US6753147
- US6787308
- US6849404
- US6852488
- US6858412
- US7155050
- US7393665
- US7424368
- US7473767
- US7476786
- US7537897
- US7544473
- US7635566
- US7822555
- US7824856
- US7824889
- US7915015
- US7985546
- US8148068
- US8168385
- US8241850
- US8298767
- US8318433
- US8445205
- US8470996
- US8476018
- US8481292
- US8486682
- US8535889
- US8563274
- US8603749
- US8679756
- US8685678
- US8715967
- US8722368
- US8728766
- US8741606
- US8835358
- US8841071
- US8856410
- US9150852
- US9228229
- US9290808
- US9290809
- US9315857
- US9567645
- US9567646
- US9582877
- US9598736
- US9637799
- US9695468
- US9708659
- US9787810
- US9727810
- US9816137
- US9845502
- US9905005
- US9951386
- US10002316
- US10047394
- US10059991
- US10131958
- US10151003
- US10202641
- US10202646
- US10208356
- US10246703
- US10253375
- US10338066
- US10669570
- US10676779
- US10822643
- US11124823
- US11371076
- US2002/0065609
- US2002/0072058
- US2002/0168665
- US2002/0187480
- US2003/0003490
- US2003/0013091
- US2003/0049616
- US2003/0082818
- US2003/0104436
- US2003/0165865
- US2003/0175908
- US2003/0186251
- US2003/0207300
- US2004/0047769
- US2004/0096368
- US2004/0096892
- US2004/0121342
- US2004/0146901
- US2004/0157243
- US2004/0191823
- US2004/0253593
- US2004/0259118
- US2005/0019776
- US2005/0053952
- US2005/0105077
- US2005/0170373
- US2005/0250147
- US2005/0287548
- US2006/0002824
- US2006/0024690
- US2006/0035258
- US2006/0040297
- US2006/0041385
- US2006/0073506
- US2006/0211030
- US2006/0223122
- US2006/0223197
- US2006/0234234
- US2006/0246453
- US2006/0263709
- US2006/0263789
- US2006/0280352
- US2006/0286570
- US2007/0020640
- US2007/0259340
- US2007/0031829
- US2007/0042400
- US2007/0042419
- US2007/0065823
- US2007/0105090
- US2007/0117121
- US2007/0117134
- US2007/0133856
- US2007/0172873
- US2007/0178478
- US2007/0202523
- US2008/0038727
- US2008/0070303
- US2008/0119736
- US2008/0194414
- US2008/0261204
- US2008/0269068
- US2008/0274458
- US2008/0299609
- US2008/0318802
- US2009/0061513
- US2009/0105959
- US2009/0137407
- US2009/0220385
- US2009/0226891
- US2009/0252414
- US2009/0253586
- US2009/0283676
- US2009/0290151
- US2009/0298709
- US2010/0069250
- US2010/0105049
- US2010/0105886
- US2010/0120630
- US2010/0159533
- US2010/0255471
- US2010/0267028
- US2010/0291666
- US2010/0300895
- US2010/0323348
- US2010/0330574
- US2011/0038507
- US2011/0059436
- US2011/0059556
- US2011/0070584
- US2011/0072889
- US2011/0160078
- US2011/0201507
- US2011/0230358
- US2011/0244455
- US2011/0294689
- US2012/0010091
- US2012/0014977
- US2012/0034607
- US2012/0040843
- US2012/0045844
- US2012/0065081
- US2012/0071331
- US2012/0087862
- US2012/0142018
- US2012/0149603
- US2012/0156675
- US2012/0163681
- US2012/0165219
- US2012/0173159
- US2012/0190020
- US2012/0220022
- US2012/0220494
- US2012/0231972
- US2012/0252012
- US2012/0253689
- US2012/0316074
- US2012/0322681
- US2013/0005585
- US2013/0022977
- US2013/0035248
- US2013/0040344
- US2013/0040843
- US2013/0040847
- US2013/0045994
- US2013/0190206
- US2013/0210643
- US2013/0210659
- US2013/0224743
- US2013/0225418
- US2013/0225623
- US2013/0237458
- US2013/0267424
- US2013/0274117
- US2013/0295568
- US2013/0323732
- US2014/0024032
- US2014/0057799
- US2014/0065609
- US2014/0147860
- US2014/0155274
- US2014/0155295
- US2014/0178438
- US2014/0194324
- US2014/0206079
- US2014/0206547
- US2014/0216128
- US2014/0227684
- US2014/0227705
- US2014/0228255
- US2014/0235506
- US2014/0243242
- US2014/0272952
- US2014/0274811
- US2014/0287963
- US2014/0303005
- US2014/0309945
- US2014/0315211
- US2014/0357500
- US2014/0378322
- US2014/0378345
- US2014/0378349
- US2014/0378350
- US2015/0005185
- US2015/0005199
- US2015/0005200
- US2015/0017652
- US2015/0066385
- US2015/0118680
- US2015/0119255
- US2015/0119256
- US2015/0119257
- US2015/0119258
- US2015/0119290
- US2015/0133319
- US2015/0203897
- US2015/0225778
- US2015/0247182
- US2015/0259734
- US2015/0298091
- US2015/0307874
- US2015/0329852
- US2015/0376609
- US2016/0026758
- US2016/0122753
- US2016/0208322
- US2016/0222378
- US2016/0244828
- US2016/0257993
- US2016/0289669
- US2016/0289670
- US2016/0289740
- US2016/0312276
- US2016/0320720
- US2016/0326584
- US2016/0376583
- US2016/0376648
- US2017/0073730
- US2017/0154421
- US2017/0342405
- US2017/0342465
- US2017/0344866
- US2018/0002738
- US2018/0002764
- US2018/0037942
- US2018/0088112
- US2018/0251825
- US2018/0258500
- US2018/0291470
- US2018/0327835
- US2019/0100798
- US2019/0119726
- US2019/0323088
- US2020/0002766
- US2020/0248175
- US2020/0255888
- US2020/0392479
- US2022/0062394
- US2022/0315988
- US102008025656
- US0 799 897
- US1 473 080
- US1 647 600
- US1 845 160
- US2 623 613
- US2 805 769
- US2005-233974
- US2008-256428
- US2013-039275
- USWO 89/01050
- USWO 96/24061
- USWO 97/10365
- USWO 99/15702
- USWO 99/28505
- USWO 00/58516
- USWO 02/056014
- USWO 02/059355
- USWO 02/070684
- USWO 04/017374
- USWO 05/042759
- USWO 05/071110
- USWO 05/080604
- USWO 05/111242
- USWO 06/071776
- USWO 06/102264
- USWO 06/116127
- USWO 07/087310
- USWO 07/087312
- USWO 07/147079
- USWO 08/096318
- USWO 08/147428
- USWO 08/150432
- USWO 09/148560
- USWO 09/152928
- USWO 10/030818
- USWO 10/117620
- USWO 11/123246
- USWO 11/143659
- USWO 11/155833
- USWO 12/038839
- USWO 12/042374
- USWO 12/047297
- USWO 12/048341
- USWO 12/083225
- USWO 12/108864
- USWO 12/129363
- USWO 12/140224
- USWO 12/142213
- USWO 12/148477
- USWO 12/149042
- USWO 12/162267
- USWO 13/019075
- USWO 13/117595
- USWO 13/130674
- USWO 13/148525
- USWO 13/173394
- USWO 13/176767
- USWO 13/177206
- USWO 13/188831
- USWO 13/188872
- USWO 13/191775
- USWO 14/015084
- USWO 14/015098
- USWO 14/018460
- USWO 14/028537
- USWO 14/071361
- USWO 14/093676
- USWO 14/108850
- USWO 14/124336
- USWO 14/124338
- USWO 14/126937
- USWO 14/144495
- USWO 14/145992
- USWO 14/201273
- USWO 14/210353
- USWO 15/002908
- USWO 15/031691
- USWO 15/035087
- USWO 15/044428
- USWO 15/047186
- USWO 15/103339
- USWO 15/134787
- USWO 15/200869
- USWO 16/138500
- USWO 17/205691
- USWO 18/089377
- USWO 18/094263