Inga Wang1*, Rick Wickstrom2
1School of Rehabilitation Sciences & Technology, University of Wisconsin-Milwaukee, Milwaukee, WI, USA
2WorkAbility Systems, Inc. West Chester, OH, USA
*Correspondence author: Inga Wang, OTR/L, PhD, School of Rehabilitation Sciences & Technology, University of Wisconsin-Milwaukee, Enderis Hall 955, 2400 E Hartford Ave, Milwaukee, USA; Email: [email protected]
Published Date: 25-11-2024
Copyright© 2024 by Wang I, et al. All rights reserved. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Abstract
Objective: A movement screen is an efficient tool to evaluate movement patterns, detect potential dysfunctions and justify interventions aimed at improving musculoskeletal health. This study aimed to examine the reliability and validity of the 10-item Active Movement Scale (AMS) for assessing musculoskeletal function in adults.
Methods: This is a cross-sectional measurement study. Fifty-five subjects receiving outpatient orthopedic physical therapy attended two sessions. Concurrently, participants underwent self-reported questionnaires and performance-based tests. Inter-rater reliability, test-retest reliability, concurrent validity and floor/ceiling effect of the AMS were assessed using weighted Kappa, percentage of agreement, Intraclass Correlation Coefficient (ICC) and Pearson correlation.
Results: Inter-rater reliability was excellent (ICC range: 0.90 to 0.93). Test-retest reliability was good (ICC range: 0.82 to 0.87). Inter-rater agreement (weighted kappa) ranged from 0.57 to 0.92. Test-retest agreement (kappa) ranged from 0.53 to 0.77. There were no floor effects, but mild ceiling effects were observed. AMST correlated highly with Lifestyle Physical Function Screen (LPFS) (r = 0.73) and moderately with PROMIS Physical Function-10a (PFF) (r = 0.61) surveys. AMS (upper body subscale) had a low correlation with Grooved Pegboard Placing (GPP) (r = 0.49), WorkAbility Rate of Manipulation Placing (WRMP) (r = 0.51). AMS (lower body subscale) correlated moderately with 10-Meter Walk at Fast pace (GSF) (r = 0.52) and Two Square Agility Test (TSAT) (r = 0.43).
Conclusion: This study provides evidence of the reliability and validity of AMS in adults with musculoskeletal disorders and identifies potential gaps for future improvement and development.
Keywords: Motor Activity; Movement Disorders; Psychometrics; Disability Evaluation; Movement; Dyskinesias; Musculoskeletal Diseases
Abbreviation
AMS: Active Movement Scale; AMST: Active Movement Scale Total score; AMSU: Active Movement Scale Upper Body subscale; AMSL: Active Movement Scale Lower Body subscale; WtH: Waist-to-Height ratio; BMI: Body Mass Index; PFF: PROMIS Physical Function 10a Survey (PFF); LPFS: Lifestyle Physical Function Screen; CAL: Cardio Activity Level; TSAT: Two Square Agility Test; GSU: 10-Meter Walk at Usual pace; GSF: 10-Meter Walk at Fast pace ; GS: Grip Strength; GPP: Grooved Pegboard Placing test; WRMT: WorkAbility Rate of Manipulation Turning test; WRMP: WorkAbility Rate of Manipulation Placing test
Introduction
Prevalent musculoskeletal conditions include low back pain, neck pain, shoulder, knee disorders and arthritis, impose a significant burden on nonfatal health diseases [1-4]. Affected individuals often suffer chronic pain and disability, leading to participation restrictions in daily activities and decreased quality of life. This results in a considerable number of annual healthcare visits and substantial costs for musculoskeletal conditions, exceeding $1 trillion in 2018 [5]. This cost burden includes healthcare costs and productivity losses due to sick days, disability and reduced work efficiency. Notably, while individuals aged 65 and over incurred the highest mean per-person costs, persons aged 45-64 in the workforce accounted for the largest share of aggregate costs due to the large size of this cohort [5].
The American Physical Therapy Association (APTA) advocates for greater access to physical therapy professionals as a strategic approach to transform society by optimizing movement to improve the human experience. This involves leveraging knowledge and expertise in the movement system to optimize movement, promote health and wellness, mitigate the progression of impairments and prevent additional disability. An endorsed strategy involves the implementation of a functional movement screen in various settings to detect movement impairments to guide the selection of additional tests and measures to include in the patient and client examination.
Typically, a movement screening tool is an observed, performance-based assessment that possesses specific characteristics. Firstly, it comprises a series of functional tasks to concurrently assess joint mobility, stability and body asymmetry through predetermined movements. Secondly, the assessment aims to be simple, practical and cost-effective to allow for completion within five to ten minutes. Thirdly, the assessment must use movement patterns of the trunk, upper extremities and lower extremities that are essential to perform daily activities. Thirdly, the assessment relies on movement patterns of the trunk, upper extremities and lower extremities that are essential to perform daily activities. Fourthly, these tools are versatile, applicable across various populations and settings. Lastly, they are designed to identify motion deficits, serve as a pre-participation functional assessment and predict injury risks within the assessed population [6].
Numerous studies have substantiated the reliability of movement screening tools (Table 1) [7-25]. In the context of known-groups validity, studies demonstrated that individuals with a documented history of musculoskeletal injury exhibit lower scores on movement screens compared to those without such antecedents [26,27]. Nevertheless, establishing a definitive association between a score falling below a cut-off threshold and an elevated risk of injury remains ambiguous. This uncertainty is likely attributable to the multifaceted nature of injury causation and the inherent challenges associated with monitoring subsequent injury occurrences [7,28-30]. Currently, the development of movement screening tools that are not only reliable and valid but also sensitive to change and effective in predicting injury risk is still limited.
Developed by Wickstrom, et al., the Active Movement Scale (AMS) was designed for movement screening to improve physical examinations [31]. It establishes a graded observational rating system for individual movements, aiding in the triage of individuals for subsequent fitness or therapeutic interventions. The initial version, AMS version 1 (v1), comprises 14 items and could be administered in about five minutes. After field testing, four items were removed and the original five-point rating scale (0 = unable/unwilling to attempt; 1 = poor; 2 = fair; 3 = guarded; 4 = normal) was reconfigured to a four-point scale (0 to 3) by merging the ‘fair’ and ‘guarded’ categories. This modification was made as clinicians identified that these two categories often represented similar levels of movement quality, making it challenging to reliably differentiate between them during assessments. The consolidation aimed to enhance consistency in scoring and improve the scale’s overall applicability in clinical practice. The revised AMS, known as version 2 (v2), consists of 10 items with a 4-point scale. This study aimed to evaluate the reliability and validity of AMS v2 at both the scale and item levels. Specifically, assessments encompass inter-rater reliability, test-retest reliability, concurrent validity and floor/ceiling effect of the AMS for evaluating musculoskeletal function in adults.
Material and Methods
Study Sample
A convenience sample of adults diagnosed with musculoskeletal disorders was prospectively recruited within a two-week period post-discharge from an outpatient physical therapy clinic in Newark, OH, from January 2021 to January 2023. Inclusion criteria were: (1) age 18 or older, (2) presence of upper or lower body musculoskeletal ailment persisting for more than two weeks, (3) independent ambulation capacity with or without assistive aids, (4) capability to perform a functional pinch grasp to pick up a pencil and (5) self-reported pain at rest rated at 6/10 or below. Exclusion criteria were: (1) recent musculoskeletal surgery within the past three months, (2) current pregnancy and (3) inability to comprehend written or spoken instructions in English.
This prospective cohort study utilized an adapted COSMIN checklist to assess the methodological rigor of health status measurements [32]. Approval for the study was granted by the Institutional Review Board of the University of Wisconsin-Milwaukee. All participants completed written informed consent before study participation.
Test Protocol
Participants attended two sessions within a 3-week timeframe. During the initial session, participants provided demographic information and underwent separate AMS assessments administered by two evaluators: an experienced Physical Therapist (PT) (rater 1) with over 30 years of clinical expertise and a Physical Therapy Assistant (PTA) (rater 2) with over 20 years of clinical experience. The sequence of evaluators was randomized using an Excel-generated random list. Both assessors received in-person training from the developer, reviewed manuals and engaged in collaborative practice.
In the second session, in addition to a repeated AMS assessment by rater 2 (PTA), participants underwent anthropometric measurements (waist girth, height and weight). Additionally, they completed self-report surveys and performance-based evaluations of physical function. The self-report surveys included the PROMIS Physical Function 10a Survey (PFF), Lifestyle Physical Function Screen (LPFS) and Cardio Activity Level (CAL). Performance-based assessments were conducted in a randomized sequence and encompassed the Two Square Agility Test (TSAT), 10-Meter Walk at Usual (GSU) and Fast (GSF) pace, Grip Strength (GS), Grooved Pegboard Placing (GPP) and WorkAbility Rate of Manipulation Turning (WRMT) and Placing (WRMP) tests.
Instruments and Variables
Anthropometric variables: Waist-to-height ratio (WtH) was computed by dividing the waist circumference (cm) by the height (cm). Body Mass Index (BMI) was determined by dividing an individual’s weight in kilograms by the square of their height in meters.
Active Movement Scale (AMS): The AMS v2 is a 10-item screen for quantifying musculoskeletal risks, triaging physical activity or classifying clients for interventions [31]. Each item is rated on a 4-point ordinal scale (0 to 3). Upper extremity tasks are seated, while lower extremity tasks are standing. Movement tasks include: (1) Extend wrists, (2) Flex elbows back, (3) Elevate shoulders, (4) Rotate head down, (5) Extend head back, (6) Forward bend over, (7) Rotate torso, (8) Step up and over, (9) Active squat down and (10) Lunge backwards. Items 1, 2, 3, 4, 7, 8 and 10 are evaluated independently for the left and right sides and the lower score between the two sides is used to calculate the total score (AMST) and two sub-scores: upper body (AMSU) and lower body (AMSL). AMST is the cumulative sum, AMSU is items 1-5 and AMSL is items 6-10. Scores were converted to a percent (0-100%), with higher scores indicating better movement function. Item descriptions and detailed administration instructions are in Appendix 1.
PROMIS Physical Function 10a Survey (PPF): PPF is a self-reported assessment consisting of 10 items that gauge an individual’s current physical function status, such as limitations in walking over a mile [33]. Raw scores were obtained by summing item points and transformed into T-scores with a mean of 50 and a standard deviation of 10, where a higher T-score signifies enhanced physical function.
Lifestyle Physical Function Screen (LPFS): LPFS is a tailored 20-item self-reported physical function survey. LPFS integrates 11 items from PROMIS Item Bank v2.0 – Physical Function and 9 items co-developed by the authors with the PROMIS team. Responses employ a 5-point ordinal scale, ranging from “no difficulty” to “cannot do”. Aggregate LPF scores were transformed into a percentage scale (0-100%), with higher scores reflecting superior physical function.
Cardio Activity Level (CAL): CAL is a 10-level rating scale designed to categorize physical activity using levels that are consistent with categories of recommended physical activity for moderate or vigorous aerobic activity [34,35].
Two Square Agility Test (TSAT): TSAT is a functional mobility assessment designed to evaluate participants’ agility. In this test, individuals step back and forth across a marked line as quickly as safety permits for a designated number of cycles. TSAT speed (cycles per second), was determined by dividing the fixed number of cycles (5) by the best (lowest) completion time recorded across three trials [35].
10-Meter Walk at Usual (GSU) and Fast (GSF) pace: 10MWT evaluates the walking gait speed (meters per second) and was computed by dividing the measured walk distance by the best (lowest) time recorded across trials [10]. Participants undergo walking assessments in two distinct modes, yielding measures denoted as usual pace (GSU) and fast pace (GSF).
Grip Strength (GS): GS was assessed using a Jamar dynamometer with a standardized protocol at position 2 handle span with arm against the side of the trunk and elbow bent at 90°C [37]. Three trials with collected for the right and left hands by alternating between sides. The study used the average of the best result (highest) for the right and left hands.
Grooved Pegboard Placing (GPP): GPP evaluates finger dexterity of placing and removing pegs using the Lafayette Instrument Company’s Grooved Pegboard [38]. The study used the best result (lowest time) from each hand’s two trials, averaged for both right and left hands. Time was then transformed into dexterity speed (pegs/minute) using the formula: Speed = 25 pegs/Time (sec) * 60 sec/minute.
WorkAbility Rate of Manipulation Turning (WRMT) and Placing (WRMP): WRMT and WRMP assess hand dexterity of turning and placing 20 disks [39]. The study calculated the average of the best (fastest) three trials for each hand and converted this time to dexterity speed (disks per minute) using the formula: Dexterity Speed = 20 disks/Time (sec) * 60 sec/minute. The conversion of the best trial times to speed in disks per minute was undertaken to facilitate score interpretation, where elevated scores denote superior performance.
Statistical Analyses
Data were analyzed using SPSS statistical package version 29 (Armonk, NY: IBM Corp). Descriptive statistics, including mean + SD and frequency counts, were utilized for data depiction. Inter-rater and test-retest reliability for AMST, AMSU and AMSL scores were assessed at the item level using intraclass correlation coefficient (ICC) (model 2,1). ICC reliability values were categorized as poor (less than 0.5), moderate (0.5 to less than 0.75), good (0.75 to less than 0.9) and excellent (0.9 to 1.00).40 Individual AMS item agreement was evaluated using percentage of agreement and weighted Kappa, with guidelines indicating slight agreement (0.0 to 0.2), fair agreement (0.2 to 0.4), moderate agreement (0.4 to 0.6), substantial agreement (0.6 to 0.8) and almost perfect agreement (0.8 to 1.0).41 Concurrent validity was assessed through Pearson correlations between AMST, AMSU and AMSL scores and 1) anthropometric variables, 2) self-report surveys and 3) performance-based physical functioning tests. The strength of correlations was categorized as negligible (less than 0.3), low (0.30 to less than 0.5), moderate (0.5 to less than 0.7), high (0.7 to less than 0.9) and very high (0.9 to 1.0) [42]. Floor and ceiling effects were determined by calculating the proportion of individuals exhibiting the worst (floor) and best (ceiling) scores for AMST, AMSU and AMSL.
Results
A total of 55 participants (mean age: 61.3 ± 15.3 years, 64% female) attended two sessions spaced 8.9 ± 4.1 days apart (range: 3 to 21 days). Regarding the injured body part, 27 (49.1%) reported musculoskeletal impairments in the upper body, while 45 (81.8%) reported conditions in the lower body. Of these, 21 (38.2%) patients came to the clinic for back pain, 13 (23.6) due to joint injury and replacement, 10 (18.2%) for trauma injuries and 8 (14.5%) for arthritis. The mean scores of AMST, AMSU and AMSL in the first session were 80.1 (11.0), 85.0 (11.8) and 75.3 (14.9) respectively and in the second session, they were 78.4 (12.4), 82.4 (12.9) and 74.3 (17.1).
The overall inter-rater reliability of AMST, AMSU and AMSL subscales was good to excellent, with ICC values of 0.92 [95% CI: (0.86, 0.96)], 0.90 [95% CI: (0.79, 0.95)] and 0.93 [95% CI: (0.88, 0.96)], respectively. The test-retest reliability of the AMST, AMSU and AMSL subscales was good, with ICC values of 0.87 [95% CI: (0.77, 0.92)], 0.82 [95% CI: (0.71, 0.89)] and 0.85 [95% CI: (0.75, 0.91)], respectively.
Table 2 presents inter-rater reliability of individual movement tasks assessed by the AMS. Inter-rater reliability for individual items exhibited a range from 0.57 (rotate head down) to 0.92 (flex elbow back), with absolute agreement spanning 70.9% to 96.4%, indicating moderate to almost perfect agreement. Table 3 presents test-retest reliability of individual movement tasks assessed by the AMS. Test-retest reliability for individual items varied from 0.53 (step up and over) to 0.77 (flex elbow back, extend head back), with absolute agreement ranging from 69.1% to 90.9%.
Table 4 illustrates the concurrent validity of AMS. AMST exhibited a high correlation with AMSU (r = 0.80) and AMSL (r = 0.88), while AMSU had a low correlation with AMSL (r = 0.43). The AMSL displayed a negative low correlation with anthropometric variables: WtH (r = -0.35) and BMI (r = -0.40). AMSU was not significantly correlated with anthropometric variables for WtH (r=-.10) and BMI (r=-0.05). Concerning self-report surveys, AMST showed a moderate correlation with PFF (r = 0.61), a high correlation with LPFS (r = 0.73) and a low correlation with CAL (r = 0.40). Regarding performance-based physical functioning tests, AMSU exhibited a moderate correlation with GPP (r = 0.49) and WRMP (r = 0.51), but lower correlations with GS (r = 0.20) and WRMT (r = 0.42). AMSL displayed a moderate correlation with GSF (r = 0.52) and lower correlations with TSAT (r = 0.43) and GSU (r = 0.45).
No floor effects were observed for the AMST, AMSU and AMSL scales. However, mild ceiling effects were noted. In the initial session, 1 participant (1.8%), 9 participants (16.4%) and 3 participants (5.5%) attained the maximum score for the AMST, AMSU and AMSL scales, respectively. In the subsequent session, 2 participants (3.6%), 11 participants (20.0%) and 9 participants (16.4%) achieved the maximum score for the AMST, AMSU and AMSL scales, respectively.
Author (Year) | Study Sample | Measure / Sample Size | Number of raters | Time intervals between sessions | Results |
Sanchez-Lastra (2022) 8 | Adults from a residential and day care centre over 18 years of age, diagnosed with intellectual disability | FMS n = 30 | 3 raters | 14 days | Composite score: · Test-retest reliability (ICC) = 0.89 for trained assessor · Test-retest reliability (ICC): 0.60 to 0.76 for novice assessor · Inter-rater reliability (ICC): 0.71 to 0.80 |
Alkhathami (2021) 9 | Young adults between 18 and 40 years of age with low back pain (LBP) | FMS n = 22 | 4 raters | 6 weeks | Composite score: · Inter-rater reliability = 0.96 to 0.99 · Intra-rater reliability = 0.97 to 0.98 · Minimal Detectable Change (MDC95) = 0.90 to 1.05 |
Harper (2021) 10 | Healthy adults without restrictions to physical activity | FMS n = 16 | 5 raters | 4 days | Composite Score: · Inter-rater reliability (composite score) = ICC 0.79 to 0.84 Individual movement scores: · Inter-rater reliability (item-level) = Kappa -0.03 to 0.96 · Poor reliability for hurdle step, inline lunge and rotator stability. |
Smith (2013) 11 | Healthy, injury-free and physically active adults (aged 22 to 44) | FMS n = 20 | 4 raters | 1 week | Composite score: · Inter-rater reliability) (ICC) = 0.87 to 0.89 (good) Individual movement scores: · Inter-rater reliability (ICC) = 0.30 to 0.98, poor for hurdle step. · Intra-rater reliability (ICC): 0.81 to 0.91 |
Shultz (2013) 12 | National Collegiate Athletic Association Division IA varsity athletes | FMS n = 21 | 6 raters | 1 week | Composite score: · Test-retest reliability Live Session (ICC) = 0.60 (good) · Live-Versus-Video Session Reliability (ICC) = 0.92 (excellent) · Inter-rater reliability (Krippendorff α) = 0.38 (poor) Individual movement scores: · Interrater reliability: 0.25 to 0.95, mean 0.47 with poor reliability for in-line lunge, trunk stability, rotatory stability and overhead squat |
Onate (2012) 13 | Convenient healthy adults from the local University and surrounding area | FMS n = 19 | 2 raters | 1 week | Inter-session reliability (ICC) = 0.92 Inter-rater reliability (ICC) = 0.98 Inter-session agreement (kappa) = 0.16 to 1.00 (mean 0.55), with hurdle step poor Inter-rater agreement (kappa) = 0.31 to 1.00, mean 0.82, with hurdle step poor |
Gribble (2013) 14 | Convenient healthy adults from the university’s athletic training clinical staff and academic programs | FMS n = 3 | 38 raters (watch videos) | 1 week | Composite score: · Intra-rater reliability (all) (ICC) = 0.75 (moderate) · Intra-rater reliability (athletic training students) (ICC) = 0.37 · Intra-rater reliability (athletic trainer) (ICC) = 0.76 (moderate) · Intra-rater reliability (ATs with at least 6 months of experience) (ICC) = 0.95 (excellent) |
Teyhen (2012) 15 | Active-duty service members | FMS n = 64 | 2 raters | 48 to 72 hours | Composite score: · Test-retest reliability (ICC) = 0.76 (moderate) · Inter-rater reliability (ICC) = 0.74 (moderate) · Minimal detectable change (MDC95) = 2.1 to 2.5 points (on the 21-point scale) Individual movements: · Inter-rater agreement (kappa) = 0.45 to 0.82, mean 0.69 · Intra-rater agreement (kappa = 0.29 to 0.76, mean 0.61, rotary stability least reliable |
Morgan (2023) 16 | Healthy, active physical therapy students | FMS n = 45 | 4 raters (watch videos) | n/a | Composite Score: · Inter-rater reliability (ICC) = 0.95 · Individual movements: Inter-rater reliability (ICC) = 0.78 to 0.96 |
Minick (2010) 18 | Healthy college students | FMS n = 40 | 4 raters (watch videos) | n/a | Individual movement scores: · Inter-rater agreement (kappa) = 0.74 to 1.00, mean 0.92 |
Aghi (2023) 19 | Healthy young athletes | SFMA n = 28 | 2 raters (watch videos) | 60 days | Individual movement scores: · Intra-rater reliability (kappa) = 0.49 (moderate) · Inter-rater reliability (kappa) = 0.60 and 0.62 (substantial) |
Stanek (2019) 20 | Physically active participants | SFMA n = 25 | 3 raters | 2 to 7 days | Individual movement scores: · Intra-rater reliability (kappa) = 0.21 to 1.00 · Intra-rater reliability (% agreement) = 0.64 to 1.00 · Inter-rater reliability (kappa) = 0.11 to 0.89 · Inter-rater reliability (% agreement) = 0.52 to 0.96 |
Glaws (2014) 21 | Healthy students from physical therapy and the Ohio State University Club Rugby team | SFMA n = 35 | 3 raters (watch videos) | 7 to 14 days | Composite Score: · Intra‐rater reliability (ICC) = 0.26 to 0.52 · Inter‐rater reliability (ICC) = 0.43 Individual movement scores: · Intra‐rater reliability (kappa) = 0.72 to 0.83 · Intra‐rater reliability ((% agreement) = 0.85 to 0.91 · Inter‐rater reliability (kappa) = 0.41 to 0.69 · Inter‐rater reliability (% agreement) = 0.61 to 0.79 |
Dolbeer (2017) 22 | Subjects with a current history of axial, lower quarter or upper quarter pain of greater than two weeks | SFMA n = 49 | 3 raters (watch videos) | n/a | Composite Score: · Inter-rater reliability (ICC) = 0.61 Individual movement scores: · Inter-rater reliability (kappa) = 0.07 to 0.85 · Inter-rater reliability (% agreement) = 0.57 to 0.96 |
Harper (2023) 23 | Healthy volunteers | SFMA n = 25 | 2 raters | n/a | Composite Score: · Inter-rater reliability (ICC) = 0.55 to 0.78 Individual movement scores: · Inter-rater reliability (kappa) = 0.01 to 0.70 |
Holm (2008) 24 | Patients with rheumatoid arthritis (RA) | KFT n = 20 | 2 raters | 1 week | Composite Score: · Intra-rater reliability (ICC) = 0.90 to 0.95 · Inter-rater reliability (ICC) = 0.92 to 0.99 |
Kierkegaard (2023) 25 | Swedish military personnel | MSP n = 35 | n/a | 1 week | Individual movement scores: · Test-retest reliability (kappa) = 0.40 to 0.75 for items pertaining physical complaints/injuries · Test-retest reliability (kappa) = 0.72 to 1.00 for items pertaining physical performance, physical activity and exercise, eating and tobacco habits, sleep and perceived health · Test-retest reliability (kappa) = 0.47 to 0.65 for items for feeling mentally or physically prepared |
* ICC = Intraclass correlation coefficient; FMS = Functional Movement Screen; SFMA = Selective Functional Movement Assessment; KFT = Keitel Functional Test; MSP = Musculoskeletal Screening Protocol |
Table 1: Previous studies examining reliability of functional movement screen.
Inter-rater reliability (rater 1 vs. 2) | ||
Weighted Kappa | Agree (%) | |
Extend wrists | 0.75 | 89.1 |
Flex elbow back | 0.92 | 96.4 |
Elevate shoulders | 0.83 | 89.1 |
Rotate head down | 0.57 | 70.9 |
Extend head back | 0.73 | 85.5 |
Forward bend over | 0.89 | 94.5 |
Rotate torso | 0.64 | 76.4 |
Step up and over | 0.70 | 81.8 |
Active squat down | 0.91 | 94.5 |
Lunge backward | 0.84 | 83.6 |
Mean | 0.78 | 0.86 |
SD | 0.12 | 0.08 |
Table 2: Inter-rater reliability of the categorical scoring of AMS individual movement tasks presented as Kappa coefficient and agreement (%).
Test-retest reliability (rater 2) | ||
Weighted Kappa | Agree | |
Extend wrists | 0.67 | 87.3 |
Flex elbows back | 0.77 | 90.9 |
Elevate shoulders | 0.70 | 80.0 |
Rotate head down | 0.55 | 70.9 |
Extend head back | 0.77 | 89.1 |
Forward bend over | 0.75 | 87.3 |
Rotate torso | 0.54 | 70.9 |
Step up and over | 0.53 | 72.7 |
Deep squat down | 0.62 | 76.4 |
Lunge backward | 0.73 | 69.1 |
Mean | 0.66 | 0.80 |
SD | 0.10 | 0.09 |
Table 3: Test-retest reliability of the categorical scoring of AMS individual movement tasks presented as Kappa coefficient and agreement (%).
AMST | AMSU | AMSL | WtH | BMI | PFF | LPFS | CAL | TSAT | GSU | GSF | GS | GPP | WRMT | |
AMST | 1 | |||||||||||||
AMSU | 0.80** | 1 | ||||||||||||
AMSL | 0.88** | 0.43** | 1 | |||||||||||
WtH | -0.28* | -0.10 | -0.35** | 1 | ||||||||||
BMI | -0.29* | -0.05 | -0.40** | 0.87** | 1 | |||||||||
PFF | 0.61** | 0.40** | 0.61** | -0.12 | -0.19 | 1 | ||||||||
LPFS | 0.73** | 0.50** | 0.71** | -0.12 | -0.15 | 0.88** | 1 | |||||||
CAL | 0.40** | 0.29* | 0.38** | -0.11 | -0.08 | 0.55** | 0.56** | 1 | ||||||
TSAT | 0.42** | 0.26 | 0.43** | -0.24 | -0.25 | 0.37** | 0.42** | 0.18 | 1 | |||||
GSU | 0.43** | 0.25 | 0.45** | -0.16 | -0.28* | 0.32* | 0.32* | 0.28* | 0.50** | 1 | ||||
GSF | 0.47** | 0.25 | 0.52** | -0.18 | -0.24 | 0.34* | 0.41** | 0.30* | 0.52** | 0.81** | 1 | |||
GS | 0.18 | 0.20 | 0.11 | -0.05 | 0.06 | 0.12 | 0.27* | 0.25 | 0.19 | 0.01 | 0.12 | 1 | ||
GPP | 0.58** | 0.49** | 0.48** | -0.32* | -0.23 | 0.44** | 0.46** | 0.15 | 0.50** | 0.24 | 0.31* | 0.25 | 1 | |
WRMT | 0.46** | 0.42** | 0.37** | -0.27* | -0.23 | 0.32* | 0.29* | 0.19 | 0.37** | 0.29* | 0.31* | 0.17 | 0.74** | 1 |
WRMP | 0.60** | 0.51** | 0.51** | -0.29* | -0.28* | 0.47** | 0.52** | 0.18 | 0.64** | 0.37** | 0.49** | 0.20 | 0.79** | 0.71** |
AMS = Active Movement Scale; AMSU = Active Movement Scale Upper; AMSL = Active Movement Scale Lower; WtH = Waist-to-height ratio; BMI = Body mass index; PFF = PROMIS Physical Function 10a Survey (PFF); LPFS = Lifestyle Physical Function Screen; CAL = Cardio Activity Level; TSAT = Two Square Agility Test; GSU = 10-Meter Walk at Usual pace; GSF = 10-Meter Walk at Fast pace; GS = Grip Strength; GPP = Grooved Pegboard Placing test; WRMT = WorkAbility Rate of Manipulation Turning test; WRMP = WorkAbility Rate of Manipulation Placing test. **. Correlation is significant at the 0.01 level (2-tailed). *. Correlation is significant at the 0.05 level (2-tailed). |
Table 4: Concurrent validity of AMS percent scores with physical function measures.
Discussion
This study explored the reliability and validity of AMS v2 in a clinical cohort with musculoskeletal disorders. The findings indicated that, overall, AMS v2 exhibits good inter-rater reliability and test-retest reliability, a slight ceiling effect, satisfactory concurrent validity and adequate construct validity. The APTA advocates for expanded access to physical therapists as an entry-point to healthcare, covering physical activity, wellness and health assessments. Concerns persist about limited Return On Investment (ROI) in traditional workplace wellness strategies that predominantly target cardiovascular risks, while often neglecting biometrics for musculoskeletal risks. Thonon, et al., reviewed the ROI of workplace-based prevention and intervention studies and reported fewer positive results for high quality studies, with only 12.3% of interventions targeting risks for musculoskeletal disorders [43]. Early utilization of the AMS presents a strategic opportunity for physical therapists to seamlessly integrate movement screening into wellness programs, empowering clinicians to prescribe cost-effective interventions that target significant movement deficits, contributing to improved movement health across the lifespan.
Movement screening tools have undergone extensive investigation across various populations, including school sports teams, specific occupational groups (e.g. fire-fighters, emergency medical responders, health center employees), military personnel and collegiate athletes [7,26-28,46-52]. Many reliability studies were conducted in healthy, physically active adults collegiate athletes and active-duty service members with fewer studies focusing on individuals with musculoskeletal impairments [7-18]. The strengths of this study included recruiting patients seeking orthopedic outpatient physical therapy care and employing a PT/PTA team as assessors. Demonstrated reliability and validity by the PT/PTA team in clinical settings suggest AMS could help bridge gaps that exist between healthcare and fitness programs.
This study identifies areas for future improvement and development. Specifically, the reliability analysis identified three items (“rotate head down,” “step up and over,” and “rotate torso”) as having the lowest weighted kappa scores and agreement. These findings suggest need to further investigate administration protocols for these tasks, with potential revisions to either the instructions or the AMS protocol to enhance consistency and clarity. Additionally, the presence of a mild ceiling effect suggests that the current assessment may not fully capture the upper limits of participants’ fitness for participation in more vigorous physical activities. To address this, it is recommended that the AMS be supplemented with more challenging performance-based items, requiring greater strength and muscle power, be incorporated into the assessment to better differentiate high-performing individuals and extend the tool’s sensitivity. For example, grip strength may be useful measure to supplement the AMS Upper Body movement outcome, given the availability of normative data for individuals across the lifespan and association of low grip strength with sedentary work, cardiovascular disease, stroke, increased hospitalization and overall mortality [37,50].
Our findings are pivotal as they lay the groundwork for subsequent research endeavors. Further research should focus on advancing the AMS, addressing its current limitations due to the lack of normative reference values and a cut-off score for score interpretation. Our research team is dedicated to exploring the clinical applications of the AMS, including the integration of smartphone apps for video capture and kinematic analysis of joint angles to enhance observed movement ratings and provide immediate feedback.
Our findings are pivotal as they lay the groundwork for subsequent research endeavors. Further research should focus on advancing the AMS, addressing its current limitations due to the lack of normative reference values and a cut-off score for score interpretation. Our research team is dedicated to exploring the clinical applications of the AMS, including the integration of smartphone apps for video capture and kinematic analysis of joint angles to enhance observed movement ratings and provide immediate feedback.
Conclusion
This study provides evidence of the reliability and validity of AMS in adults with musculoskeletal disorders and potential gaps for future improvement and development.
Key Points
- The American Physical Therapy Association (APTA) has advocated leveraging knowledge and expertise in the movement system to optimize movement, promote health and wellness, mitigate the progression of impairments and prevent additional disability
- The AMS introduces an efficient and quantitative framework for movement screening that rated based on observed performance, extending beyond the mere identification of movement dysfunctions or symptom complaints
- The AMS presents a strategic opportunity for physical therapists, physical therapist assistants or other health practitioners to seamlessly integrate movement screening into wellness programs, empowering clinicians to prescribe appropriate physical activity by identifying significant movement deficits and enabling direction to self-administered exercises for individuals with minor deficits
Conflict of Interest
The authors declare that they have no conflict of interest in this paper.
Funding
None
Authors’ Contributions
All authors contributed equally in this paper.
References
- Althomali OW, Amin J, Alghamdi W, Shaik DH. Prevalence and factors associated with musculoskeletal disorders among secondary schoolteachers in hail, Saudi Arabia: a cross-sectional survey. Int J Environ Res Public Health. 2021;18(12):6632.
- Parto DN, Wong AY, Macedo L. Prevalence of musculoskeletal disorders and associated risk factors in canadian university students. BMC Musculoskelet Disord. 2023;24(1):501.
- Hendi OM, Abdulaziz AA, Althaqafi AM, Hindi AM, Khan SA, Atalla AA. Prevalence of musculoskeletal disorders and its correlation to physical activity among health specialty students. Int J Prev Med. 2019;10:48.
- Gagnon R, Perreault K, Guertin JR, Berthelot S, Achou B, Hébert LJ. Health-related quality of life of patients presenting to the emergency department with a musculoskeletal disorder. Clin Outcomes Res CEOR. 2022;14:91-103.
- Yelin EH, Cisternas M. The Burden of Musculoskeletal Diseases in the United States (BMUS). Fourth. United States Bone and Joint Initiative (USBJI); 2020.
- Abraham A, Sannasi R, Nair R. Normative values for the functional movement screen in adolescent school aged children. Int J Sports Phys Ther. 2015;10(1):29-36.
- Miyamori T, Nagao M, Shimasaki Y. Reliability assessment of the functional movement screen for predicting injury risk in Japanese college soccer players. J Phys Ther Sci. 2020;32(12):850-55.
- Sanchez-Lastra MA, Moldes JM, Diz JC, Martínez-Lemos RI, Ayán C. Feasibility and reliability of the Functional Movement Screen battery in adults with intellectual disability. J Intellect Disabil Res JIDR. 2022;66(6):568-75.
- Alkhathami K, Alshehre Y, Wang-Price S, Brizzolara K. Reliability and validity of the functional movement screenTM with a modified scoring system for young adults with low back pain. Int J Sports Phys Ther. 2021;16(3):620-7.
- Harper BA, Glass SM. Item-level and composite-level interrater reliability of functional movement screenTM scores following condensed training in novice raters. Int J Sports Phys Ther. 2021;16(4):1016-24.
- Smith CA, Chimera NJ, Wright NJ, Warren M. Interrater and intrarater reliability of the functional movement screen. J Strength Cond Res. 2013;27(4):982-7.
- Shultz R, Anderson SC, Matheson GO, Marcello B, Besier T. Test-retest and interrater reliability of the functional movement screen. J Athl Train. 2013;48(3):331-6.
- Onate JA, Dewey T, Kollock RO. Real-time intersession and interrater reliability of the functional movement screen. J Strength Cond Res. 2012;26(2):408-15.
- Gribble PA, Brigle J, Pietrosimone BG, Pfile KR, Webster KA. Intrarater reliability of the functional movement screen. J Strength Cond Res. 2013;27(4):978-81.
- Teyhen DS, Shaffer SW, Lorenson CL. The functional movement screen: a reliability study. J Orthop Sports Phys Ther. 2012;42(6):530-40.
- Morgan R, LeMire S, Knoll L. The functional movement screen: exploring interrater reliability between raters in the updated version. Int J Sports Phys Ther. 2023;V18(3):737-45.
- Leeder JE, Horsley IG, Herrington LC. The inter-rater reliability of the functional movement screen within an athletic population using untrained raters. J Strength Cond Res. 2016;30(9):2591-9.
- Minick KI, Kiesel KB, Burton L, Taylor A, Plisky P, Butler RJ. Interrater reliability of the functional movement screen. J Strength Cond Res. 2010;24(2):479-86.
- Aghi A, Salvagnini D, Berton G. Selective Functional Movement Assessment (SFMA) reliability and proposal of its use in sports. Int J Environ Res Public Health. 2023;20(3):2032.
- Stanek JM, Smith J, Petrie J. Intra- and inter-rater reliability of the Selective Functional Movement Assessment (SFMA) in healthy participants. Int J Sports Phys Ther. 2019;14(1):107-16.
- Glaws KR, Juneau CM, Becker LC, Di Stasi SL, Hewett TE. Intra- and inter-rater reliability of the selective functional movement assessment (sfma). Int J Sports Phys Ther. 2014;9(2):195-207.
- Dolbeer J, Mason J, Morris J, Crowell M, Goss D. Inter-rater reliability of the selective Functional Movement Assessment (SFMA) by SFMA certified physical therapists with similar clinical and rating experience. Int J Sports Phys Ther. 2017;12(5):752-63.
- Harper B, Aron A. Novice Inter-Rater Reliability on the Selective Functional Movement Assessment (SFMA) after a 4-hour training session. Int J Sports Phys Ther. 2023;18(4):940-8.
- Holm B, Jacobsen S, Skjodt H. Keitel functional test for patients with rheumatoid arthritis: translation, reliability, validity and responsiveness. Phys Ther. 2008;88(5):664-78.
- Kierkegaard M, Tegern M, Broman L, Halvarsson A, Larsson H. Test-retest reliability and translation of the musculoskeletal screening protocol questionnaire used in the swedish armed forces. Mil Med. 2022;188(7-8):2318-24.
- Davis JD, Orr R, Knapik JJ, Harris D. Functional Movement Screen (FMSTM) scores and demographics of US army pre-ranger candidates. Mil Med. 2020;185(5-6):e788-94.
- Zarei M, Soltanirad S, Kazemi A, Hoogenboom BJ, Hosseinzadeh M. Composite functional movement screen score predicts injuries in youth volleyball players: a prospective cohort study. Sci Rep. 2022;12(1):20207.
- Supples MW, Brichler KP, Glober NK, Lardaro TA, O’Donnell DP. Functional movement screen did not predict musculoskeletal injury among emergency medical services professionals. Work Read Mass. 2022;71(3):795-802.
- Moran RW, Schneiders AG, Mason J, Sullivan SJ. Do Functional Movement Screen (FMS) composite scores predict subsequent injury? A systematic review with meta-analysis. Br J Sports Med. 2017;51(23):1661-9.
- Trinidad-Fernandez M, Gonzalez-Sanchez M, Cuesta-Vargas AI. Is a low functional movement screen score (≤14/21) associated with injuries in sport? a systematic review and meta-analysis. BMJ Open Sport Exerc Med. 2019;5(1):e000501.
- Wickstrom R, Wang I. Reliability and concurrent validity of the Active Movement Scale (AMS) in adults with musculoskeletal disorders. Work. 2024.
- Mokkink LB, Terwee CB, Patrick DL. The COSMIN checklist for assessing the methodological quality of studies on measurement properties of health status measurement instruments: an international Delphi study. Qual Life Res Int J Qual Life Asp Treat Care Rehabil. 2010;19(4):539-49.
- Fox GWC, Rodriguez S, Rivera-Reyes L, Loo G, Hazan A, Hwang U. PROMIS physical function 10-item short form for older adults in an emergency setting. J Gerontol A Biol Sci Med Sci. 2020;75(7):1418-23.
- US Department of Health and Human Services. Physical Activity Guidelines for Americans. 2nd Department of Health and Human Services. 2018.
- Piercy KL, Troiano RP, Ballard RM. The physical activity guidelines for Americans. JAMA. 2018;320(19):2020-8.
- Wickstrom RJ, Wang YC, Wickstrom NE, Smith RL, Dunning KK. A new two square agility test for workplace health-reliability, validity and minimal detectable change. J Phys Ther Sci. 2019;31(10):823-30.
- Matsuoka T, Fujimoto K, Kawahara M. Comparison of comfortable and maximum walking speed in the 10-meter walk test during the cerebrospinal fluid tap test in iNPH patients: A retrospective study. Clin Neurol Neurosurg. 2022;212:107049.
- Wang YC, Bohannon RW, Li X, Sindhu B, Kapellusch J. Hand-grip strength: normative reference values and equations for individuals 18 to 85 years of age residing in the United States. J Orthop Sports Phys Ther. 2018;48(9):685-93.
- Wang YC, Wickstrom R, Yen SC, Kapellusch J, Grogan KA. Assessing manual dexterity: comparing the workability rate of manipulation test with the minnesota manual dexterity test. J Hand Ther Off J Am Soc Hand Ther. 2018;31(3):339-47.
- Koo TK, Li MY. A guideline of selecting and reporting intraclass correlation coefficients for reliability research. J Chiropr Med. 2016;15(2):155-63.
- Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics. 1977;33(1):159-74.
- Mukaka M. A guide to appropriate use of Correlation coefficient in medical research. Malawi Med J J Med Assoc Malawi. 2012;24(3):69-71.
- Thonon F, Godon-Rensonnet AS, Perozziello A, Garsi JP, Dab W, Emsalem P. Return on investment of workplace-based prevention interventions: a systematic review. Eur J Public Health. 2023;33(4):612-8.
- Keil NJ, Darby LA, Keylock T, Kiss J. Functional Movement ScreenTM in High School Basketball Players: Pre- and Post-Season. Int J Exerc Sci. 2022;15(6):1-14.
- Lisman P, Nadelen M, Hildebrand E, Leppert K, de la Motte S. Functional movement screen and Y-Balance test scores across levels of American football players. Biol Sport. 2018;35(3):253-60.
- Jafari M, Zolaktaf V, Ghasemi G. Functional movement screen composite scores in firefighters: effects of corrective exercise training. J Sport Rehabil. 2020;29(1):102-6.
- Shore E, Dally M, Brooks S, Ostendorf D, Newman M, Newman L. Functional movement screen as a predictor of occupational injury among denver firefighters. Saf Health Work. 2020;11(3):301-6.
- Miller JM, Susa KJ. Functional Movement Screen scores in a group of division IA athletes. J Sports Med Phys Fitness. 2019;59(5):779-83.
- Triplett CR, Dorrel BS, Symonds ML, Selland CA, Jensen DD, Poole CN. Functional Movement Screen Detected Asymmetry & Normative Values Among College-Aged Students. Int J Sports Phys Ther. 2021;16(2):450-8.
- Sikora D, Linek P. The relationship between the functional movement screen and the Y balance test in youth footballers. PeerJ. 2022;10:e13906.
Article Type
Research Article
Publication History
Accepted Date: 31-10-2024
Accepted Date: 18-11-2024
Published Date: 25-11-2024
Copyright© 2024 by Wang I, et al. All rights reserved. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation: Wang I, et al. Psychometric Properties of the 10-Item Active Movement Scale (AMS) for Assessing Musculoskeletal Function in Adults. J Ortho Sci Res. 2024;5(3):1-13.
Author (Year) | Study Sample | Measure / Sample Size | Number of raters | Time intervals between sessions | Results |
Sanchez-Lastra (2022) 8
| Adults from a residential and day care centre over 18 years of age, diagnosed with intellectual disability | FMS n = 30 | 3 raters | 14 days | Composite score: · Test-retest reliability (ICC) = 0.89 for trained assessor · Test-retest reliability (ICC): 0.60 to 0.76 for novice assessor · Inter-rater reliability (ICC): 0.71 to 0.80 |
Alkhathami (2021) 9 | Young adults between 18 and 40 years of age with low back pain (LBP) | FMS n = 22 | 4 raters | 6 weeks | Composite score: · Inter-rater reliability = 0.96 to 0.99 · Intra-rater reliability = 0.97 to 0.98 · Minimal Detectable Change (MDC95) = 0.90 to 1.05 |
Harper (2021) 10 | Healthy adults without restrictions to physical activity | FMS n = 16 | 5 raters | 4 days | Composite Score: · Inter-rater reliability (composite score) = ICC 0.79 to 0.84 Individual movement scores: · Inter-rater reliability (item-level) = Kappa -0.03 to 0.96 · Poor reliability for hurdle step, inline lunge and rotator stability. |
Smith (2013) 11 | Healthy, injury-free and physically active adults (aged 22 to 44) | FMS n = 20 | 4 raters | 1 week | Composite score: · Inter-rater reliability) (ICC) = 0.87 to 0.89 (good) Individual movement scores: · Inter-rater reliability (ICC) = 0.30 to 0.98, poor for hurdle step. · Intra-rater reliability (ICC): 0.81 to 0.91 |
Shultz (2013) 12 | National Collegiate Athletic Association Division IA varsity athletes | FMS n = 21 | 6 raters | 1 week | Composite score: · Test-retest reliability Live Session (ICC) = 0.60 (good) · Live-Versus-Video Session Reliability (ICC) = 0.92 (excellent) · Inter-rater reliability (Krippendorff α) = 0.38 (poor) Individual movement scores: · Interrater reliability: 0.25 to 0.95, mean 0.47 with poor reliability for in-line lunge, trunk stability, rotatory stability and overhead squat |
Onate (2012) 13 | Convenient healthy adults from the local University and surrounding area | FMS n = 19 | 2 raters | 1 week | Inter-session reliability (ICC) = 0.92 Inter-rater reliability (ICC) = 0.98
Inter-session agreement (kappa) = 0.16 to 1.00 (mean 0.55), with hurdle step poor Inter-rater agreement (kappa) = 0.31 to 1.00, mean 0.82, with hurdle step poor |
Gribble (2013) 14 | Convenient healthy adults from the university’s athletic training clinical staff and academic programs | FMS n = 3 | 38 raters (watch videos) | 1 week | Composite score: · Intra-rater reliability (all) (ICC) = 0.75 (moderate) · Intra-rater reliability (athletic training students) (ICC) = 0.37 · Intra-rater reliability (athletic trainer) (ICC) = 0.76 (moderate) · Intra-rater reliability (ATs with at least 6 months of experience) (ICC) = 0.95 (excellent) |
Teyhen (2012) 15 | Active-duty service members | FMS n = 64 | 2 raters | 48 to 72 hours | Composite score: · Test-retest reliability (ICC) = 0.76 (moderate) · Inter-rater reliability (ICC) = 0.74 (moderate) · Minimal detectable change (MDC95) = 2.1 to 2.5 points (on the 21-point scale) Individual movements: · Inter-rater agreement (kappa) = 0.45 to 0.82, mean 0.69 · Intra-rater agreement (kappa = 0.29 to 0.76, mean 0.61, rotary stability least reliable |
Morgan (2023) 16 | Healthy, active physical therapy students | FMS n = 45 | 4 raters (watch videos) | n/a | Composite Score: · Inter-rater reliability (ICC) = 0.95 · Individual movements: Inter-rater reliability (ICC) = 0.78 to 0.96 |
Minick (2010) 18 | Healthy college students | FMS n = 40 | 4 raters (watch videos) | n/a | Individual movement scores: · Inter-rater agreement (kappa) = 0.74 to 1.00, mean 0.92 |
Aghi (2023) 19 | Healthy young athletes | SFMA n = 28 | 2 raters (watch videos) | 60 days | Individual movement scores: · Intra-rater reliability (kappa) = 0.49 (moderate) · Inter-rater reliability (kappa) = 0.60 and 0.62 (substantial) |
Stanek (2019) 20 | Physically active participants | SFMA n = 25 | 3 raters | 2 to 7 days | Individual movement scores: · Intra-rater reliability (kappa) = 0.21 to 1.00 · Intra-rater reliability (% agreement) = 0.64 to 1.00 · Inter-rater reliability (kappa) = 0.11 to 0.89 · Inter-rater reliability (% agreement) = 0.52 to 0.96 |
Glaws (2014) 21 | Healthy students from physical therapy and the Ohio State University Club Rugby team | SFMA n = 35 | 3 raters (watch videos) | 7 to 14 days | Composite Score: · Intra‐rater reliability (ICC) = 0.26 to 0.52 · Inter‐rater reliability (ICC) = 0.43 Individual movement scores: · Intra‐rater reliability (kappa) = 0.72 to 0.83 · Intra‐rater reliability ((% agreement) = 0.85 to 0.91 · Inter‐rater reliability (kappa) = 0.41 to 0.69 · Inter‐rater reliability (% agreement) = 0.61 to 0.79 |
Dolbeer (2017) 22 | Subjects with a current history of axial, lower quarter or upper quarter pain of greater than two weeks | SFMA n = 49 | 3 raters (watch videos) | n/a | Composite Score: · Inter-rater reliability (ICC) = 0.61 Individual movement scores: · Inter-rater reliability (kappa) = 0.07 to 0.85 · Inter-rater reliability (% agreement) = 0.57 to 0.96 |
Harper (2023) 23 | Healthy volunteers | SFMA n = 25 | 2 raters | n/a | Composite Score: · Inter-rater reliability (ICC) = 0.55 to 0.78 Individual movement scores: · Inter-rater reliability (kappa) = 0.01 to 0.70 |
Holm (2008) 24 | Patients with rheumatoid arthritis (RA) | KFT n = 20 | 2 raters | 1 week | Composite Score: · Intra-rater reliability (ICC) = 0.90 to 0.95 · Inter-rater reliability (ICC) = 0.92 to 0.99 |
Kierkegaard (2023) 25 | Swedish military personnel | MSP n = 35 | n/a | 1 week | Individual movement scores: · Test-retest reliability (kappa) = 0.40 to 0.75 for items pertaining physical complaints/injuries · Test-retest reliability (kappa) = 0.72 to 1.00 for items pertaining physical performance, physical activity and exercise, eating and tobacco habits, sleep and perceived health · Test-retest reliability (kappa) = 0.47 to 0.65 for items for feeling mentally or physically prepared |
* ICC = Intraclass correlation coefficient; FMS = Functional Movement Screen; SFMA = Selective Functional Movement Assessment; KFT = Keitel Functional Test; MSP = Musculoskeletal Screening Protocol |
Table 1: Previous studies examining reliability of functional movement screen.
| Inter-rater reliability (rater 1 vs. 2) | |
| Weighted Kappa | Agree (%) |
Extend wrists | 0.75 | 89.1 |
Flex elbow back | 0.92 | 96.4 |
Elevate shoulders | 0.83 | 89.1 |
Rotate head down | 0.57 | 70.9 |
Extend head back | 0.73 | 85.5 |
Forward bend over | 0.89 | 94.5 |
Rotate torso | 0.64 | 76.4 |
Step up and over | 0.70 | 81.8 |
Active squat down | 0.91 | 94.5 |
Lunge backward | 0.84 | 83.6 |
Mean | 0.78 | 0.86 |
SD | 0.12 | 0.08 |
Table 2: Inter-rater reliability of the categorical scoring of AMS individual movement tasks presented as Kappa coefficient and agreement (%).
| Test-retest reliability (rater 2) | |
| Weighted Kappa | Agree |
Extend wrists | 0.67 | 87.3 |
Flex elbows back | 0.77 | 90.9 |
Elevate shoulders | 0.70 | 80.0 |
Rotate head down | 0.55 | 70.9 |
Extend head back | 0.77 | 89.1 |
Forward bend over | 0.75 | 87.3 |
Rotate torso | 0.54 | 70.9 |
Step up and over | 0.53 | 72.7 |
Deep squat down | 0.62 | 76.4 |
Lunge backward | 0.73 | 69.1 |
Mean | 0.66 | 0.80 |
SD | 0.10 | 0.09 |
Table 3: Test-retest reliability of the categorical scoring of AMS individual movement tasks presented as Kappa coefficient and agreement (%).
AMST | AMSU | AMSL | WtH | BMI | PFF | LPFS | CAL | TSAT | GSU | GSF | GS | GPP | WRMT | |
AMST | 1 |
|
| |||||||||||
AMSU | 0.80** | 1 |
| |||||||||||
AMSL | 0.88** | 0.43** | 1 | |||||||||||
WtH | -0.28* | -0.10 | -0.35** | 1 | ||||||||||
BMI | -0.29* | -0.05 | -0.40** | 0.87** | 1 | |||||||||
PFF | 0.61** | 0.40** | 0.61** | -0.12 | -0.19 | 1 | ||||||||
LPFS | 0.73** | 0.50** | 0.71** | -0.12 | -0.15 | 0.88** | 1 | |||||||
CAL | 0.40** | 0.29* | 0.38** | -0.11 | -0.08 | 0.55** | 0.56** | 1 | ||||||
TSAT | 0.42** | 0.26 | 0.43** | -0.24 | -0.25 | 0.37** | 0.42** | 0.18 | 1 | |||||
GSU | 0.43** | 0.25 | 0.45** | -0.16 | -0.28* | 0.32* | 0.32* | 0.28* | 0.50** | 1 | ||||
GSF | 0.47** | 0.25 | 0.52** | -0.18 | -0.24 | 0.34* | 0.41** | 0.30* | 0.52** | 0.81** | 1 | |||
GS | 0.18 | 0.20 | 0.11 | -0.05 | 0.06 | 0.12 | 0.27* | 0.25 | 0.19 | 0.01 | 0.12 | 1 | ||
GPP | 0.58** | 0.49** | 0.48** | -0.32* | -0.23 | 0.44** | 0.46** | 0.15 | 0.50** | 0.24 | 0.31* | 0.25 | 1 | |
WRMT | 0.46** | 0.42** | 0.37** | -0.27* | -0.23 | 0.32* | 0.29* | 0.19 | 0.37** | 0.29* | 0.31* | 0.17 | 0.74** | 1 |
WRMP | 0.60** | 0.51** | 0.51** | -0.29* | -0.28* | 0.47** | 0.52** | 0.18 | 0.64** | 0.37** | 0.49** | 0.20 | 0.79** | 0.71** |
AMS = Active Movement Scale; AMSU = Active Movement Scale Upper; AMSL = Active Movement Scale Lower; WtH = Waist-to-height ratio; BMI = Body mass index; PFF = PROMIS Physical Function 10a Survey (PFF); LPFS = Lifestyle Physical Function Screen; CAL = Cardio Activity Level; TSAT = Two Square Agility Test; GSU = 10-Meter Walk at Usual pace; GSF = 10-Meter Walk at Fast pace; GS = Grip Strength; GPP = Grooved Pegboard Placing test; WRMT = WorkAbility Rate of Manipulation Turning test; WRMP = WorkAbility Rate of Manipulation Placing test. **. Correlation is significant at the 0.01 level (2-tailed). *. Correlation is significant at the 0.05 level (2-tailed). |
Table 4: Concurrent validity of AMS percent scores with physical function measures.