PULSTAR DIFFERENTIAL COMPLIANCE SPINAL INSTRUMENT: A RANDOMIZED INTEREXAMINER AND INTRAEXAMINER RELIABILITY STUDY

PulStar Differential Compliance Spinal Instrument:
A Randomized Interexaminer and Intraexaminer Reliability Study
This section was compiled by Frank M. Painter, D.C.
Send all comments or additions to: Frankp@chiro.org

FROM: J Manipulative Physiol Ther 2003 (Oct); 26 (8): 493–501 ~ FULL TEXT

Robert A Leach, DC, Patrick L Parker, Paul S Veal, DC
OBJECTIVE: To provide an entry-level, new technology reliability assessment of the PulStar computer-assisted, differential compliance spinal instrument.

SUBJECTS: Eighteen college students (9 male and 9 female) were recruited by announcements and personal contacts.

METHODS: Following approval of the consent process by the Institutional Review Board of Mississippi State University, a PulStar Function Recording and Analysis System (PulStarFRAS) device was evaluated for clinical reliability. Two examiners, blinded from data collection, used the instrument on individual subjects in random order (lying prone with their backs exposed) to administer light impulses (approximately equal to .9 J which produced a 3- to 4-lb force) at each segmental level throughout the cervical, dorsal, and lumbar spine using probe tips spaced 3 cm apart, straddling the spinous processes, while a computer recorded the findings (resistance on a scale of 0 to 25.5 lb force). Data were analyzed by Exploratory Data Analysis (EDA) with analysis of variance (ANOVA) testing and by use of the intraclass correlation coefficient (ICC). In addition, a mean test (ANOVA) was conducted to determine if a trend in variation occurred as a result of repeated light thrusts to the spine, independent of variance explained by different examiners.

RESULTS: Using EDA analysis and ANOVA, intraexaminer reliability for the 2 practitioners was very high but not perfect. This was confirmed by ICC statistics demonstrating good to excellent reliability for both practitioners (0.89 for the experienced practitioner, 0.78 for the newly trained practitioner). Interexaminer reliability of PulStar was similarly very high but not perfect based on EDA/ANOVA analysis and good to excellent (ICC = 0.87).

CONCLUSION: The PulStar mechanical adjusting device set to analysis mode appears to have good to excellent reliability when used by either an experienced or a novice (but trained) examiner. In addition, as a measure for resistance to a light thrust or spinal compliance, reliability was similarly good to excellent between the 2 doctors using the PulStar instrument.

From the Full-Text Article:

Introduction

Throughout chiropractic history, spinal lesions— termed subluxation or, more recently, subluxation complex—have typically been described as “tight” or “taut fibers,” tender on palpation, that influence the nervous system and impair health; restricted joint motion has been a clinically relevant associated phenomenon as well. [1] According to Sandoz, [2] the first chiropractic text published in 1906 makes note that, “A simple subluxated vertebra differs from a normal vertebra only in its field of motion. . . .” Despite at least 2 decades of initial chiropractic research into proposed mediating variables of the purported chiropractic vertebral subluxation complex (VSC), including measures of motion and fixation, current methods for assessing common spinal problems are generally considered to have only poor to marginal reliability. [3] More recent evidence indicates that at least 1 test of passive intersegmental mobility may have acceptable clinical reliability when conducted on symptomatic subjects but not when used on asymptomatic subjects. [4] For example, physical therapists had acceptable / (weighted [w]) values (ie, acceptable reliability was a [w] value .40; poor reliability was a [w] value < .40) in 10 of 11 symptomatic subjects but had only poor reliability between therapists in 34 of 35 asymptomatic individuals when assessing intersegmental motion range between C2 and T5. [4]

Fischer [5] was the first to report a new device used to measure soft tissue compliance, the Tissue Compliance Meter (TCM) (Pain, Diagnostics and Thermography, Great Neck, NY), which uses a 0.5-cm cylindrical rubber tipped probe with a collar that remains at the surface of the skin even as the probe is pushed into the paraspinal tissues. A force gauge attached to the opposite end of the probe measures the subject’s tolerance to pressure deformation. When used on paraspinal surfaces, it is thought to measure the displacement that occurs when force is applied to the skin overlying the muscles and unknown variables, which might include muscle tone, edema, and skin elasticity, parameters that might theoretically be associated with the purported VSC. [1, 6] Jansen et al [7] were the first chiropractic investigators to report on the device. Initial investigations on 20 asymptomatic male and female subjects revealed that 26% of the paraspinal sites tested significantly different on retest 10 minutes later, although better correlations were found using 2 kg of force (r = 0.70 to 0.92 at L3) than 4 kg of force (r = 0.12 to 0.52 at L3). [7] Others have questioned the reliability of the TCM as well, and even with a much smaller diameter disk, Kawchuk and Herzog, [6] after 10 trials performed in random order by 5 examiners, found only poor reliability using the TCM on 3 substrates and 1 control surface. In contrast with these studies, Sanders and Lawson [8] found adequate reliability and stability between 10-minute intervals in 40 asymptomatic subjects, with only 5% of normal subjects demonstrating significant left/right paraspinal differences. In addition, Waldorf et al [9] demonstrated excellent interexaminer concordance (r values 0.90) on 50 male and 50 female asymptomatic subjects while lying in the prone position and found test-retest temporal variability after 15 minutes and 2-week intervals was low. Finally, Nansel et al [10] used the TCM to determine whether lumbar paraspinal muscle tone is altered by cervical spine adjusting; in a blinded randomized trial, they determined that lower cervical diversified adjustments caused significantly more relaxation bilaterally at the L4-5 level than was observed after upper cervical adjustments (P = .01). While providing no direct evidence that the TCM measures the elusive VSC, these researchers provided evidence that measures of paraspinal compliance may be affected by chiropractic adjustments.

Recently Evans, [11] Evans and Collins, [12] and Leach [13] reported on a novel device used to deliver a light thrust into the paraspinal tissues and measure the resistance, or spinal compliance, in an effort to identify areas of the spine where “fixation” might indicate VSC and effect improved clinical outcomes. Because the instrument — interfaced with a computer — measures and compares resistance from 1 spinal segment to the next, Evans [11] terms this differential compliance. Unlike the prior measures of tissue compliance using the TCM, which relied on subjects verbally identifying when tolerance to pressure had been attained, PulStar measures and compares resistance to a light thrust between consecutive spinal segments. Evans et al [14] proposed that areas of the spine that exhibit increased resistance to a light pulsed thrust might indicate the presence of inflammation, muscle spasm, and/or joint fixation and serve as a mediating variable for VSC. Moreover, they speculate that since the PulStar uses a higher velocity low-amplitude thrust — unlike the TCM which is used to slowly apply greater pressure to examine paraspinal muscles by regions — it may identify intersegmental fixations by identifying significant variance in resistance between consecutive motion segments. However, before subluxation detection strategies and measures of spinal dysfunction can be evaluated for clinical validity and utility, researchers must first develop and identify the most clinically reliable methods for evaluation of spinal function. [3, 15] Computer-aided instruments have the potential to more reliably identify spinal lesions that chiropractors treat, and transducers might arguably be expected to more accurately assess spinal motion restriction and compliance than would manual procedures done by hand and using instruments such as the TCM.

With this in mind, a pilot intraexaminer reliability investigation was completed in April 2001 on a convenience sample of 15 subjects to determine effect size necessary to perform the present randomized intraexaminer and interexaminer reliability study. As expected based on preliminary observations, intraexaminer reliability was best at the C2 spinal level and worst at the C6 level (perhaps owing to easier palpation of the former segment). This preliminary work indicated that at least minimal examiner training was important and that the best protocol included having subjects lying prone with the head slightly flexed (perhaps 25°). It was found that the instrument’s probes must be held flat prior to discharge, that a gentle downward pressure was necessary to permit the instrument to make a more accurate and repeatable preload measurement and discharge, and that retest repeatability was best when the shaft of the instrument was held at 90° perpendicular to the skin’s surface, with the probes equidistant from the spinous process of the vertebral motion segment being tested. Finally, our unpublished pilot study findings seemed to confirm the findings of other PulStar users, who observed that better relaxation and less variability are seen when the clinician first demonstrates the instrument on the subject’s hand, prior to making measurements on the spine with the patient lying prone. The finding that subjects should lie prone for improved interexaminer and intraexaminer reliability was also suggested by another pilot investigation which is being submitted elsewhere. [16] Based on findings of the 2 pilot studies, it was determined that 20 normal subjects (10 male subjects, 10 female subjects) would be suitable to yield satisfactory preliminary reliability data for the PulStar, with a minimum of 2 blinded practitioners each randomly performing 2 examinations on each subject.

The purpose of the present investigation was to extend and expand on the pilot studies and perform a randomized, blinded, new technology assessment of the clinical reliability of a computer-assisted chiropractic analytic device that measures spinal compliance, a measure of resistance to a light pulsed paraspinal thrust. The primary aim then was to determine whether 2 chiropractic doctors could find acceptable agreements, while the PulStar was set to the analysis (ie, measurement) mode. A secondary aim was to provide a preliminary baseline of measures of normal spinal compliance for further research.

Discussion

Of all the differential spinal compliance measures made with the PulStar unit, significant variance between 2 examinations by the same doctor occurred only at the occiput and C3 and only for the novice examiner, while significant variance between examiners occurred only at occiput and C4. The measurement at occiput involved placing the probes over the occiput in such a manner that they straddled the external occipital protuberance. It is quite possible that despite stabilizing the probes by resting them against the side of the first finger of the examiner's free hand, some sliding of the probes during discharge might have created unacceptable reliability at this level. Others using PulStar place the probes directly over the atlanto-occipital joints for the occipital measurement, and this may prove to be a more satisfactory arrangement. Further research will be needed to verify this finding and to determine whether an occipital site is even necessary. Obviously, no other vertebral segment that we tested posed the unique anatomy found at the occiput.

We were only mildly surprised to find poorer agreement at the C3 and C4 levels, after the results of our earlier pilot investigation had revealed poorest reproducibility at the C6 level. Only late in the pilot investigation did we begin having the patient flex the neck, and those were the very subjects that showed improved test-retest reliability. In the present inquiry, all subjects fully flexed the neck while lying prone, which certainly seems to improve palpation of the lower cervical spine, relatively increasing the difficulty in locating the shorter C3 and C4 spinous processes. Others have proposed that midcervical palpation is most difficult as well, [20] and ultrasound has been used to image the spine, [21] as well as “indentation” testing (ie, not a quick pulsed thrust like the PulStar uses, but rather pressure applied at a rate of 2.5 mm/s until a load of 1 N is attained, by use of a flat, rigid, 3 × 3 cm surface) combined with ultrasound imaging to improve examiner palpatory reliability. [22] Whether ultrasonic or other imaging is needed to perform more reliable midcervical PulStar compliance measurements remains to be determined. It may be that PulStar used as a more global measure of cervical compliance (rather than differentiating exactly which 2 segments are less compliant) would provide clinically relevant information, even without palpatory determination of the exact locations of the C3 and C4 vertebrae.

We may also speculate that the amount of neck flexion in our subjects might have varied significantly from examination to examination, producing a confounding variable that affected reliability of the C3–4 measure. In this regard, although we are unaware of studies on the cervical spine, certainly there are recent observations on the lumbar spine by Caling and Lee [23] that suggest posteroanterior stiffness varies significantly with the direction of applied force. Further investigations might want to control for this variable by monitoring the degree of cervical flexion during PulStar testing by use of electrogoniometry; however, up to 9° of measurement error in flexion/extension may still be a confounder. [24] Researchers might also utilize an instrument that establishes a 90° perpendicular to the spine, to determine if that is a source of error.

It is worthy to note that in the present study rules for use of the instrument were more stringent than those used in clinical practice and might have actually led to underestimation of clinical reliability. Hence, while researchers generally agree that establishing clinical reliability is easier in an experimental setting following strict protocols than in a busy practice where compensation depends on volume of services and not necessarily quality of care rendered, in the present investigation doctors were not allowed to repeat their examination even if they thought the PulStar probe had slipped, was at an angle other than 90° perpendicular to the spinous, or because they had miscounted the level of spinous that they were checking. In each of these cases, a practicing clinician can do a reexamination to check the data; in contrast, since this was a blinded investigation, doctors were not allowed the opportunity to recheck their work. Further research of PulStar reliability should include the possibility of the clinician repeating his examination and suggesting which examination should be used (while still blinded from data collection). In this way, we might know whether repeat trials, such as would be available to the clinician in private practice, would enhance the reliability of the procedure.

Two experienced clinicians (as opposed to 1 novice clinician and 1 experienced clinician) might not have difficulty with interexaminer reliability using the PulStar in the midcervical spine. However, despite some evidence of fatigue for the novice examiner, whose variance between trials increased slightly (0.4 lb force from the 1st to the 18th subject), his overall rate of variance ranged only from 0.8 to 1.2 lb force, not dissimilar from the more constant variance rate of 1.0 lb force observed by the experienced examiner. From the standpoint of clinical reliability, this is a wash; despite some differences, it appears that for both examiners an error range of 1.0 lb force might be expected between any 2 trials. The present investigation then revealed no clinically meaningful difference in reliability between the experienced and novice, but trained, investigator.

Finally, using the EDA graphic analysis, we provide a preliminary view of normal spinal compliance using the PulStar instrument, which may guide further research aimed at developing norms for specific populations. It is noteworthy that averaged data on 18 subjects from all 4 trials (2 doctors × 2 trials) indicate that spinal compliance was greatest in the lower cervical and lumbar spines and lowest over the occiput (control site) and upper dorsal spine. Certainly, further research will be necessary to confirm and extend these preliminary observations, comparing normal populations to patients in pain, for example. Also, it should be understood that this report did not measure the validity of the differential compliance analysis, a computerized analysis which measures the difference in spinal compliance between vertebral segments and triggers the PulStar to provide more pulsed adjustments to areas of fixation or poor compliance. Only further research of trial validity can determine the significance and clinical meaningfulness of the computerized analysis and of the computer-guided PulStar adjustment itself.

It is uncommon, if not rare, to find either spinal fixation or chiropractic subluxation detection strategies that have a high degree of intraexaminer and interexaminer reliability, [25] and this has prompted some to suggest abandoning research in this area altogether. [3, 26–27] Since no individual or panel of chiropractic experts to date has been able to agree on an operational definition or so-called gold standard dependent variable to measure subluxation, [28–31] we here make the assumption that if there are subluxation-free spines, it is more likely that we will find them in younger, pain-free individuals, whose spines have not yet been subject to decades of postural and physical insults. While we concede that we cannot rule out the presence of VSC, purported to influence nerves and viscera, in the young college students in our trial (primarily because we do not yet know how to measure subluxation complex), if they did have these lesions they apparently did not adversely affect clinical reliability of the apparatus we tested. Of course, we will only learn whether the PulStar measure correlates with outcomes and whether it is capable of serving as a mediating variable of VSC (ie, becoming part of a gold standard for VSC diagnosis) if clinical research on trial validity of differential compliance is conducted. [1]

The results then of the present inquiry on the reliability of the PulStar instrument set to the analysis mode, a novel new chiropractic technology assessment utilizing the first patented computer-assisted device developed to measure spinal compliance and possibly fixation, are certainly promising and warrant further research. More research using different doctors and on larger numbers of subjects, including some with pain, would help determine the generalizability of these findings. Protocols we developed may also be used to conduct research aimed at establishing norms for different populations, including potentially patients with pain, obese individuals, and otherwise normal subjects, to verify and extend our initial findings on these healthy college students. Now that there is initial evidence of good to excellent clinical reliability of the PulStar spinal compliance measure, trials should also be designed and implemented to determine what this phenomenon means in terms of chiropractic patient care and outcomes (ie, trial and construct validity).

Conclusion

The PulStar mechanical adjusting device set to analysis mode appears to have good to excellent reliability when used by either an experienced or a novice (but trained) examiner. In addition, as a measure of spinal resistance to a light pulsed thrust or spinal compliance, reliability was similarly good to excellent between the 2 doctors using the PulStar instrument. Preliminary results indicate spinal compliance in normal subjects is greatest in the lower cervical and lumbar spines and lowest at the upper cervical and upper dorsal levels. This initial study does not address the validity or clinical significance of the measurement method. Further research will be necessary using greater numbers and a wider variety of subjects and more diverse examiners, to verify these findings and fully understand the generalizability of these results.

Return to INSTRUMENT ADJUSTING
Since 5-12-2004

Home Page

Visit Our Sponsors

Become a Sponsor

Join us

Please read our DISCLAIMER

PulStar Differential Compliance Spinal Instrument: A Randomized Interexaminer and Intraexaminer Reliability Study

Return to INSTRUMENT ADJUSTING

PulStar Differential Compliance Spinal Instrument:
A Randomized Interexaminer and Intraexaminer Reliability Study