FROM:
J Manipulative Physiol Ther 2003 (Oct); 26 (8): 493–501 ~ FULL TEXT
Robert A Leach, DC, Patrick L Parker, Paul S Veal, DC
OBJECTIVE: To provide an entry-level, new technology reliability assessment of the PulStar computer-assisted, differential compliance spinal instrument.
SUBJECTS: Eighteen college students (9 male and 9 female) were recruited by announcements and personal contacts.
METHODS: Following approval of the consent process by the Institutional Review Board of Mississippi State University, a PulStar Function Recording and Analysis System (PulStarFRAS) device was evaluated for clinical reliability. Two examiners, blinded from data collection, used the instrument on individual subjects in random order (lying prone with their backs exposed) to administer light impulses (approximately equal to .9 J which produced a 3- to 4-lb force) at each segmental level throughout the cervical, dorsal, and lumbar spine using probe tips spaced 3 cm apart, straddling the spinous processes, while a computer recorded the findings (resistance on a scale of 0 to 25.5 lb force). Data were analyzed by Exploratory Data Analysis (EDA) with analysis of variance (ANOVA) testing and by use of the intraclass correlation coefficient (ICC). In addition, a mean test (ANOVA) was conducted to determine if a trend in variation occurred as a result of repeated light thrusts to the spine, independent of variance explained by different examiners.
RESULTS: Using EDA analysis and ANOVA, intraexaminer reliability for the 2 practitioners was very high but not perfect. This was confirmed by ICC statistics demonstrating good to excellent reliability for both practitioners (0.89 for the experienced practitioner, 0.78 for the newly trained practitioner). Interexaminer reliability of PulStar was similarly very high but not perfect based on EDA/ANOVA analysis and good to excellent (ICC = 0.87).
CONCLUSION: The PulStar mechanical adjusting device set to analysis mode appears to have good to excellent reliability when used by either an experienced or a novice (but trained) examiner. In addition, as a measure for resistance to a light thrust or spinal compliance, reliability was similarly good to excellent between the 2 doctors using the PulStar instrument.
From the Full-Text Article:
Introduction
Throughout chiropractic history, spinal lesions—
termed subluxation or, more recently, subluxation
complex—have typically been described as “tight”
or “taut fibers,” tender on palpation, that influence the
nervous system and impair health; restricted joint motion
has been a clinically relevant associated phenomenon as
well. [1] According to Sandoz, [2] the first chiropractic text published in 1906 makes note that, “A simple subluxated vertebra differs from a normal vertebra only in its field of motion. . . .” Despite at least 2 decades of initial chiropractic research into proposed mediating variables of the purported chiropractic vertebral subluxation complex (VSC), including measures of motion and fixation, current methods for
assessing common spinal problems are generally considered
to have only poor to marginal reliability. [3] More recent evidence indicates that at least 1 test of passive intersegmental mobility may have acceptable clinical reliability when conducted on symptomatic subjects but not when used on asymptomatic subjects. [4] For example, physical therapists
had acceptable / (weighted [w]) values (ie, acceptable
reliability was a [w] value .40; poor reliability was a
[w] value < .40) in 10 of 11 symptomatic subjects but had
only poor reliability between therapists in 34 of 35 asymptomatic individuals when assessing intersegmental motion
range between C2 and T5. [4]
Fischer [5] was the first to report a new device used to
measure soft tissue compliance, the Tissue Compliance
Meter (TCM) (Pain, Diagnostics and Thermography, Great
Neck, NY), which uses a 0.5-cm cylindrical rubber tipped
probe with a collar that remains at the surface of the skin
even as the probe is pushed into the paraspinal tissues. A
force gauge attached to the opposite end of the probe
measures the subject’s tolerance to pressure deformation.
When used on paraspinal surfaces, it is thought to measure
the displacement that occurs when force is applied to the
skin overlying the muscles and unknown variables, which
might include muscle tone, edema, and skin elasticity, parameters that might theoretically be associated with the
purported VSC. [1, 6] Jansen et al [7] were the first chiropractic
investigators to report on the device. Initial investigations
on 20 asymptomatic male and female subjects revealed that
26% of the paraspinal sites tested significantly different on
retest 10 minutes later, although better correlations were
found using 2 kg of force (r = 0.70 to 0.92 at L3) than 4 kg
of force (r = 0.12 to 0.52 at L3). [7] Others have questioned
the reliability of the TCM as well, and even with a much
smaller diameter disk, Kawchuk and Herzog, [6] after 10 trials
performed in random order by 5 examiners, found only poor
reliability using the TCM on 3 substrates and 1 control
surface. In contrast with these studies, Sanders and Lawson [8]
found adequate reliability and stability between 10-minute
intervals in 40 asymptomatic subjects, with only 5% of
normal subjects demonstrating significant left/right paraspinal differences. In addition, Waldorf et al [9] demonstrated
excellent interexaminer concordance (r values 0.90) on
50 male and 50 female asymptomatic subjects while lying in
the prone position and found test-retest temporal variability
after 15 minutes and 2-week intervals was low. Finally,
Nansel et al [10] used the TCM to determine whether lumbar
paraspinal muscle tone is altered by cervical spine adjusting; in a blinded randomized trial, they determined that
lower cervical diversified adjustments caused significantly
more relaxation bilaterally at the L4-5 level than was observed after upper cervical adjustments (P = .01). While
providing no direct evidence that the TCM measures the
elusive VSC, these researchers provided evidence that measures of paraspinal compliance may be affected by chiropractic adjustments.
Recently Evans, [11] Evans and Collins, [12] and Leach [13] reported on a novel device used to deliver a light thrust into
the paraspinal tissues and measure the resistance, or spinal
compliance, in an effort to identify areas of the spine where
“fixation” might indicate VSC and effect improved clinical
outcomes. Because the instrument — interfaced with a computer — measures and compares resistance from 1 spinal
segment to the next, Evans [11] terms this differential compliance. Unlike the prior measures of tissue compliance using
the TCM, which relied on subjects verbally identifying
when tolerance to pressure had been attained, PulStar measures and compares resistance to a light thrust between
consecutive spinal segments. Evans et al [14] proposed that
areas of the spine that exhibit increased resistance to a light
pulsed thrust might indicate the presence of inflammation,
muscle spasm, and/or joint fixation and serve as a mediating
variable for VSC. Moreover, they speculate that since the
PulStar uses a higher velocity low-amplitude thrust — unlike
the TCM which is used to slowly apply greater pressure to
examine paraspinal muscles by regions — it may identify
intersegmental fixations by identifying significant variance
in resistance between consecutive motion segments. However, before subluxation detection strategies and measures
of spinal dysfunction can be evaluated for clinical validity
and utility, researchers must first develop and identify the
most clinically reliable methods for evaluation of spinal
function. [3, 15] Computer-aided instruments have the potential
to more reliably identify spinal lesions that chiropractors
treat, and transducers might arguably be expected to more
accurately assess spinal motion restriction and compliance
than would manual procedures done by hand and using
instruments such as the TCM.
With this in mind, a pilot intraexaminer reliability investigation was completed in April 2001 on a convenience
sample of 15 subjects to determine effect size necessary to
perform the present randomized intraexaminer and interexaminer reliability study. As expected based on preliminary
observations, intraexaminer reliability was best at the C2
spinal level and worst at the C6 level (perhaps owing to
easier palpation of the former segment). This preliminary
work indicated that at least minimal examiner training was
important and that the best protocol included having subjects lying prone with the head slightly flexed (perhaps 25°).
It was found that the instrument’s probes must be held flat
prior to discharge, that a gentle downward pressure was
necessary to permit the instrument to make a more accurate
and repeatable preload measurement and discharge, and that
retest repeatability was best when the shaft of the instrument
was held at 90° perpendicular to the skin’s surface, with the
probes equidistant from the spinous process of the vertebral
motion segment being tested. Finally, our unpublished pilot
study findings seemed to confirm the findings of other
PulStar users, who observed that better relaxation and less
variability are seen when the clinician first demonstrates the
instrument on the subject’s hand, prior to making measurements on the spine with the patient lying prone. The finding
that subjects should lie prone for improved interexaminer
and intraexaminer reliability was also suggested by another
pilot investigation which is being submitted elsewhere. [16] Based on findings of the 2 pilot studies, it was determined that 20 normal subjects (10 male subjects, 10 female subjects) would be suitable to yield satisfactory preliminary reliability data for the PulStar, with a minimum of 2 blinded practitioners each randomly performing 2 examinations on each subject.
The purpose of the present investigation was to extend
and expand on the pilot studies and perform a randomized, blinded, new technology assessment of the clinical reliability of a computer-assisted chiropractic analytic device that
measures spinal compliance, a measure of resistance to a
light pulsed paraspinal thrust. The primary aim then was to
determine whether 2 chiropractic doctors could find acceptable agreements, while the PulStar was set to the analysis
(ie, measurement) mode. A secondary aim was to provide a
preliminary baseline of measures of normal spinal compliance for further research.
Discussion
Of all the differential spinal compliance measures made with the PulStar unit, significant variance between 2 examinations by the same doctor occurred only at the occiput and C3 and only for the novice examiner, while significant variance between examiners occurred only at occiput and C4. The measurement at occiput involved placing the probes over the occiput in such a manner that they straddled the external occipital protuberance. It is quite possible that despite stabilizing the probes by resting them against the side of the first finger of the examiner's free hand, some sliding of the probes during discharge might have created unacceptable reliability at this level. Others using PulStar place the probes directly over the atlanto-occipital joints for the occipital measurement, and this may prove to be a more satisfactory arrangement. Further research will be needed to verify this finding and to determine whether an occipital site is even necessary. Obviously, no other vertebral segment that we tested posed the unique anatomy found at the occiput.
We were only mildly surprised to find poorer agreement at the C3 and C4 levels, after the results of our earlier pilot investigation had revealed poorest reproducibility at the C6 level. Only late in the pilot investigation did we begin having the patient flex the neck, and those were the very subjects that showed improved test-retest reliability. In the present inquiry, all subjects fully flexed the neck while lying prone, which certainly seems to improve palpation of the lower cervical spine, relatively increasing the difficulty in locating the shorter C3 and C4 spinous processes. Others have proposed that midcervical palpation is most difficult as well, [20] and ultrasound has been used to image the spine, [21] as well as “indentation” testing (ie, not a quick pulsed thrust like the PulStar uses, but rather pressure applied at a rate of 2.5 mm/s until a load of 1 N is attained, by use of a flat, rigid, 3 × 3 cm surface) combined with ultrasound imaging to improve examiner palpatory reliability. [22] Whether ultrasonic or other imaging is needed to perform more reliable midcervical PulStar compliance measurements remains to be determined. It may be that PulStar used as a more global measure of cervical compliance (rather than differentiating exactly which 2 segments are less compliant) would provide clinically relevant information, even without palpatory determination of the exact locations of the C3 and C4 vertebrae.
We may also speculate that the amount of neck flexion in our subjects might have varied significantly from examination to examination, producing a confounding variable that affected reliability of the C3–4 measure. In this regard, although we are unaware of studies on the cervical spine, certainly there are recent observations on the lumbar spine by Caling and Lee [23] that suggest posteroanterior stiffness varies significantly with the direction of applied force. Further investigations might want to control for this variable by monitoring the degree of cervical flexion during PulStar testing by use of electrogoniometry; however, up to 9° of measurement error in flexion/extension may still be a confounder. [24] Researchers might also utilize an instrument that establishes a 90° perpendicular to the spine, to determine if that is a source of error.
It is worthy to note that in the present study rules for use of the instrument were more stringent than those used in clinical practice and might have actually led to underestimation of clinical reliability. Hence, while researchers generally agree that establishing clinical reliability is easier in an experimental setting following strict protocols than in a busy practice where compensation depends on volume of services and not necessarily quality of care rendered, in the present investigation doctors were not allowed to repeat their examination even if they thought the PulStar probe had slipped, was at an angle other than 90° perpendicular to the spinous, or because they had miscounted the level of spinous that they were checking. In each of these cases, a practicing clinician can do a reexamination to check the data; in contrast, since this was a blinded investigation, doctors were not allowed the opportunity to recheck their work. Further research of PulStar reliability should include the possibility of the clinician repeating his examination and suggesting which examination should be used (while still blinded from data collection). In this way, we might know whether repeat trials, such as would be available to the clinician in private practice, would enhance the reliability of the procedure.
Two experienced clinicians (as opposed to 1 novice clinician and 1 experienced clinician) might not have difficulty with interexaminer reliability using the PulStar in the midcervical spine. However, despite some evidence of fatigue for the novice examiner, whose variance between trials increased slightly (0.4 lb force from the 1st to the 18th subject), his overall rate of variance ranged only from 0.8 to 1.2 lb force, not dissimilar from the more constant variance rate of 1.0 lb force observed by the experienced examiner. From the standpoint of clinical reliability, this is a wash; despite some differences, it appears that for both examiners an error range of 1.0 lb force might be expected between any 2 trials. The present investigation then revealed no clinically meaningful difference in reliability between the experienced and novice, but trained, investigator.
Finally, using the EDA graphic analysis, we provide a preliminary view of normal spinal compliance using the PulStar instrument, which may guide further research aimed at developing norms for specific populations. It is noteworthy that averaged data on 18 subjects from all 4 trials (2 doctors × 2 trials) indicate that spinal compliance was greatest in the lower cervical and lumbar spines and lowest over the occiput (control site) and upper dorsal spine. Certainly, further research will be necessary to confirm and extend these preliminary observations, comparing normal populations to patients in pain, for example. Also, it should be understood that this report did not measure the validity of the differential compliance analysis, a computerized analysis which measures the difference in spinal compliance between vertebral segments and triggers the PulStar to provide more pulsed adjustments to areas of fixation or poor compliance. Only further research of trial validity can determine the significance and clinical meaningfulness of the computerized analysis and of the computer-guided PulStar adjustment itself.
It is uncommon, if not rare, to find either spinal fixation or chiropractic subluxation detection strategies that have a high degree of intraexaminer and interexaminer reliability, [25] and this has prompted some to suggest abandoning research in this area altogether. [3, 26–27] Since no individual or panel of chiropractic experts to date has been able to agree on an operational definition or so-called gold standard dependent variable to measure subluxation, [28–31] we here make the assumption that if there are subluxation-free spines, it is more likely that we will find them in younger, pain-free individuals, whose spines have not yet been subject to decades of postural and physical insults. While we concede that we cannot rule out the presence of VSC, purported to influence nerves and viscera, in the young college students in our trial (primarily because we do not yet know how to measure subluxation complex), if they did have these lesions they apparently did not adversely affect clinical reliability of the apparatus we tested. Of course, we will only learn whether the PulStar measure correlates with outcomes and whether it is capable of serving as a mediating variable of VSC (ie, becoming part of a gold standard for VSC diagnosis) if clinical research on trial validity of differential compliance is conducted. [1]
The results then of the present inquiry on the reliability of the PulStar instrument set to the analysis mode, a novel new chiropractic technology assessment utilizing the first patented computer-assisted device developed to measure spinal compliance and possibly fixation, are certainly promising and warrant further research. More research using different doctors and on larger numbers of subjects, including some with pain, would help determine the generalizability of these findings. Protocols we developed may also be used to conduct research aimed at establishing norms for different populations, including potentially patients with pain, obese individuals, and otherwise normal subjects, to verify and extend our initial findings on these healthy college students. Now that there is initial evidence of good to excellent clinical reliability of the PulStar spinal compliance measure, trials should also be designed and implemented to determine what this phenomenon means in terms of chiropractic patient care and outcomes (ie, trial and construct validity).
Conclusion
The PulStar mechanical adjusting device set to analysis mode appears to have good to excellent reliability when used by either an experienced or a novice (but trained) examiner. In addition, as a measure of spinal resistance to a light pulsed thrust or spinal compliance, reliability was similarly good to excellent between the 2 doctors using the PulStar instrument. Preliminary results indicate spinal compliance in normal subjects is greatest in the lower cervical and lumbar spines and lowest at the upper cervical and upper dorsal levels. This initial study does not address the validity or clinical significance of the measurement method. Further research will be necessary using greater numbers and a wider variety of subjects and more diverse examiners, to verify these findings and fully understand the generalizability of these results.