JWST-TST DREAMS: NIRSpec/PRISM Transmiss

JWST-TST DREAMS: NIRSpec/PRISM Transmission Spectroscopy of the Habitable Zone Planet TRAPPIST-1 e - IOPscience
The American Astronomical Society (AAS)
, established in 1899 and based in Washington, DC, is the major organization of professional astronomers in North America. Its membership of about 7,000 individuals also includes physicists, mathematicians, geologists, engineers, and others whose research and educational interests lie within the broad spectrum of subjects comprising contemporary astronomy. The mission of the AAS is to enhance and share humanity's scientific understanding of the universe.
The following article is
Open access
JWST-TST DREAMS: NIRSpec/PRISM Transmission Spectroscopy of the Habitable Zone Planet TRAPPIST-1 e
Néstor Espinoza
Natalie H. Allen
Ana Glidden
Nikole K. Lewis
Sara Seager
Caleb I. Cañas
David Grant
Amélie Gressier
Shelby Courreges
Kevin B. Stevenson
Published 2025 September 8
© 2025. The Author(s). Published by the American Astronomical Society.
The Astrophysical Journal Letters
Volume 990
Number 2
Citation
Néstor Espinoza
et al
2025
ApJL
990
L52
DOI
10.3847/2041-8213/adf42e
Article
PDF
Article
ePub
You need an eReader or compatible software to experience
the benefits of the ePub3 file format
Néstor Espinoza
AFFILIATIONS
Space Telescope Science Institute, 3700 San Martin Drive, Baltimore, MD 21218, USA
William H. Miller III Department of Physics and Astronomy, Johns Hopkins University, Baltimore, MD 21218, USA
Natalie H. Allen
AFFILIATIONS
William H. Miller III Department of Physics and Astronomy, Johns Hopkins University, Baltimore, MD 21218, USA
Ana Glidden
AFFILIATIONS
Department of Earth, Atmospheric and Planetary Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
Kavli Institute for Astrophysics and Space Research, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
Nikole K. Lewis
AFFILIATIONS
Department of Astronomy and Carl Sagan Institute, Cornell University, 122 Sciences Drive, Ithaca, NY 14853, USA
Sara Seager
AFFILIATIONS
Department of Earth, Atmospheric and Planetary Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
Kavli Institute for Astrophysics and Space Research, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
Department of Aeronautics and Astronautics, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA
Caleb I. Cañas
AFFILIATIONS
NASA Goddard Space Flight Center, Greenbelt, MD 20771, USA
David Grant
AFFILIATIONS
University of Bristol, HH Wills Physics Laboratory, Tyndall Avenue, Bristol, UK
Amélie Gressier
AFFILIATIONS
Space Telescope Science Institute, 3700 San Martin Drive, Baltimore, MD 21218, USA
Shelby Courreges
AFFILIATIONS
University of Texas at Austin, Department of Astronomy, 2515 Speedway C1400, Austin, TX 78712, USA
Kevin B. Stevenson
AFFILIATIONS
Johns Hopkins APL, 11100 Johns Hopkins Road, Laurel, MD 20723, USA
Sukrit Ranjan
AFFILIATIONS
University of Arizona, Lunar and Planetary Laboratory/Department of Planetary Sciences, Tucson, AZ 85721, USA
Knicole Colón
AFFILIATIONS
NASA Goddard Space Flight Center, Greenbelt, MD 20771, USA
Brett M. Morris
AFFILIATIONS
Space Telescope Science Institute, 3700 San Martin Drive, Baltimore, MD 21218, USA
Ryan J. MacDonald
AFFILIATIONS
Department of Astronomy, University of Michigan, 1085 South University Avenue, Ann Arbor, MI 48109, USA
Author notes
NHFP Sagan Fellow.
Douglas Long
AFFILIATIONS
Space Telescope Science Institute, 3700 San Martin Drive, Baltimore, MD 21218, USA
Hannah R. Wakeford
AFFILIATIONS
University of Bristol, HH Wills Physics Laboratory, Tyndall Avenue, Bristol, UK
Jeff A. Valenti
AFFILIATIONS
Space Telescope Science Institute, 3700 San Martin Drive, Baltimore, MD 21218, USA
Lili Alderson
AFFILIATIONS
Department of Astronomy, Cornell University, 122 Sciences Drive, Ithaca, NY 14853, USA
Natasha E. Batalha
AFFILIATIONS
NASA Ames Research Center, MS 245-3, Moffett Field, CA 94035, USA
Ryan C. Challener
AFFILIATIONS
Department of Astronomy and Carl Sagan Institute, Cornell University, 122 Sciences Drive, Ithaca, NY 14853, USA
Jingcheng Huang
AFFILIATIONS
Department of Earth, Atmospheric and Planetary Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
Zifan Lin
AFFILIATIONS
Department of Earth, Atmospheric and Planetary Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
Dana R. Louie
AFFILIATIONS
Catholic University of America, Department of Physics, Washington, DC 20064, USA
Exoplanets and Stellar Astrophysics Laboratory (Code 667), NASA Goddard Space Flight Center, Greenbelt, MD 20771, USA
Center for Research and Exploration in Space Science and Technology II, NASA/GSFC, Greenbelt, MD 20771, USA
Elijah Mullens
AFFILIATIONS
Department of Astronomy and Carl Sagan Institute, Cornell University, 122 Sciences Drive, Ithaca, NY 14853, USA
Daniel Valentine
AFFILIATIONS
University of Bristol, HH Wills Physics Laboratory, Tyndall Avenue, Bristol, UK
C. Matt Mountain
AFFILIATIONS
Association of Universities for Research in Astronomy, 1331 Pennsylvania Avenue NW Suite 1475, Washington, DC 20004, USA
Laurent Pueyo
AFFILIATIONS
Space Telescope Science Institute, 3700 San Martin Drive, Baltimore, MD 21218, USA
Marshall D. Perrin
AFFILIATIONS
Space Telescope Science Institute, 3700 San Martin Drive, Baltimore, MD 21218, USA
Andrea Bellini
AFFILIATIONS
Space Telescope Science Institute, 3700 San Martin Drive, Baltimore, MD 21218, USA
Jens Kammerer
AFFILIATIONS
European Southern Observatory, Karl-Schwarzschild-Straße 2, 85748 Garching, Germany
Mattia Libralato
AFFILIATIONS
INAF-Osservatorio Astronomico di Padova, Via dell’Osservatorio 5, 35122 Padova, Italy
Isabel Rebollido
AFFILIATIONS
European Space Agency (ESA), European Space Astronomy Centre (ESAC), Camino Bajo del Castillo s/n, 28692 Villanueva de la Cañada, Madrid, Spain
Emily Rickman
AFFILIATIONS
European Space Agency (ESA), ESA Office, Space Telescope Science Institute, 3700 San Martin Drive, Baltimore, MD 21218, USA
Sangmo Tony Sohn
AFFILIATIONS
Space Telescope Science Institute, 3700 San Martin Drive, Baltimore, MD 21218, USA
Department of Astronomy & Space Science, Kyung Hee University, 1732 Deogyeong-daero, Yongin-si, Gyeonggi-do 17104, Republic of Korea
Roeland P. van der Marel
AFFILIATIONS
Space Telescope Science Institute, 3700 San Martin Drive, Baltimore, MD 21218, USA
William H. Miller III Department of Physics and Astronomy, Johns Hopkins University, Baltimore, MD 21218, USA
Notes
Article metrics
7896
Total downloads
Video abstract views
Share this article
Dates
Received
2025 May 21
Revised
2025 July 21
Accepted
2025 July 24
Published
2025 September 8
Unified Astronomy Thesaurus concepts
Exoplanet atmospheres
Exoplanet astronomy
Exoplanets
James Webb Space Telescope
Extrasolar rocky planets
Habitable planets
Habitable zone
Journal RSS
Create or edit your corridor alerts
What are corridors?
2041-8205/990/2/L52
Abstract
TRAPPIST-1 e is one of the very few rocky exoplanets that is both amenable to atmospheric characterization and resides in the habitable zone of its star—located at a distance from its star such that it might, with the right atmosphere, sustain liquid water on its surface. Here, we present a set of four JWST/NIRSpec PRISM transmission spectra of TRAPPIST-1 e obtained in mid-to-late 2023. Our transmission spectra exhibit similar levels of stellar contamination as observed in prior works for other planets in the TRAPPIST-1 system but over a wider wavelength range, showcasing the challenge of characterizing the TRAPPIST-1 planets even at relatively long wavelengths (3–5
m). While we show that current stellar modeling frameworks are unable to explain the stellar contamination features in our spectra, we demonstrate that we can marginalize over those features instead using Gaussian processes, which enables us to perform novel exoplanet atmospheric inferences with our transmission spectra. In particular, we are able to rule out cloudy, primary H
-dominated (≳80% by volume) atmospheres at better than a 3
level. Constraints on possible secondary atmospheres on TRAPPIST-1 e are presented in a companion paper. Our work showcases how JWST is breaking ground in the precision needed to constrain the atmospheric composition of habitable-zone rocky exoplanets.
Export citation and abstract
BibTeX
RIS
Previous
article in issue
Next
article in issue
NASA ADS Record
About Related Links
Original content from this work may be used under the terms of the
Creative Commons Attribution 4.0 licence
. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.
1. Introduction
In over 2 yr of scientific operations, JWST (J. P. Gardner et al.
2023
) has revolutionized the field of exoplanet atmospheres. Its unprecedented spectrophotometric precision and wavelength coverage allow it to routinely detect water, carbon dioxide, and even products of photochemistry in gas giant exoplanet atmospheres (see, e.g., A. L. Carter et al.
2024
and references therein); detect rich inventories of carbon-bearing species on sub-Neptunes for the first time (see, e.g., N. Madhusudhan et al.
2023
; T. G. Beatty et al.
2024
; B. Benneke et al.
2024
; M. Holmberg & N. Madhusudhan
2024
); and, very recently, even constrain the atmospheric composition—or lack thereof—of rocky exoplanets (e.g., T. P. Greene et al.
2023
; O. Lim et al.
2023
; J. Lustig-Yaeger et al.
2023
; E. M. May et al.
2023
; S. E. Moran et al.
2023
; S. Zieba et al.
2023
; L. Alderson et al.
2024
; A. Gressier et al.
2024
; R. Hu et al.
2024
; J. Kirk et al.
2024
; J. A. Patel et al.
2024
; N. Scarsdale et al.
2024
; Q. Xue et al.
2024
; M. Zhang et al.
2024
; M. K. Alam et al.
2025
; P. C. August et al.
2025
; M. Radica et al.
2025
; P. Wachiraphan et al.
2025
).
Among the rocky planetary systems amenable for atmospheric characterization with JWST, the seven Earth-sized planet system orbiting TRAPPIST-1 arguably stands as one of the most exciting for performing such detailed studies (M. Gillon et al.
2017
). The system provides unique avenues for characterization, as the M dwarf they orbit is small, which maximizes the observability of the signals in both emission and transmission spectroscopy (see, e.g., the discussion in R. Doyon
2024
). Perhaps most interestingly, however, the seven planets span a wide range of instellations (the amount of energy incident on a planetary body from its host star), with the inner planets TRAPPIST-1 b, c, and d having 4×, 2×, and 1.1× Earth’s instellation; TRAPPIST-1 e, f, and g spanning the system’s habitable zone (HZ)—the distance at which exoplanets with the right atmospheres might sustain liquid water on their surfaces (J. F. Kasting et al.
1993
; R. K. Kopparapu et al.
2013
)—and TRAPPIST-1 h having only 10% of Earth’s instellation. While it is heavily debated in the literature whether these exoplanets should have retained atmospheres at all given the large expected cumulative X-ray and ultraviolet irradiation produced by their active M dwarf host (see, e.g., C. Garraffo et al.
2017
; P. J. Wheatley et al.
2017
; M. Turbet et al.
2020
; J. Krissansen-Totton
2023
; G. Van Looveren et al.
2024
and references therein), the system’s wide range of instellations provides a unique laboratory to test these predictions, ultimately offering an opportunity to test whether the solar system’s cosmic shoreline, which would predict atmospheres to be more likely for the rocky exoplanets farther away from the star and with larger escape velocities, also stands for planets orbiting stars elsewhere (K. J. Zahnle & D. C. Catling
2017
).
While pioneering observations with the Hubble Space Telescope (HST) WFC3 have provided some constraints on possible atmospheric compositions for all the planets in the system (see, e.g., J. de Wit et al.
2016
2018
; Z. Zhang et al.
2018
; L. J. Garcia et al.
2022
; A. Gressier et al.
2022
), recent JWST explorations through different instrument modes have provided even tighter constraints thanks to the improved precision and wavelength coverage of the observatory. Initial secondary eclipse observations with JWST/MIRI 15
m photometry (T. P. Greene et al.
2023
; S. Zieba et al.
2023
) and 0.6−2.8
m transmission spectroscopy with NIRISS/SOSS (O. Lim et al.
2023
; M. Radica et al.
2025
) have both revealed that sizable atmospheres for the TRAPPIST-1 b and c planets are unlikely. While these results put few constraints on the kinds of atmospheres that the exoplanets farther away from their host star have (J. Krissansen-Totton
2023
; M. T. Gialluca et al.
2024
), the NIRISS/SOSS results highlight how stellar contamination—the distortion of the observed transmission spectrum due to stellar surface heterogeneities (see, e.g., B. V. Rackham et al.
2018
and references therein)—largely shapes the observed transmission spectra, at least at wavelengths <3
m, showcasing it as the biggest challenge when it comes to studying the exoplanets in the system. Whether this holds true for wavelengths longer than 3
m, where some predictions suggest stellar contamination could be less prominent (see, e.g., B. V. Rackham et al.
2018
), has not been studied empirically to date.
Here we present a set of four transmission spectra for TRAPPIST-1 e, one of the exoplanets in the HZ of the TRAPPIST-1 system (
= 0.92
= 0.69
= 6.10 days,
eq
= 250 K) obtained with JWST/NIRSpec PRISM in the 0.6−5
m range. This work is part of a series of studies being pursued by the JWST Telescope Scientist Team (JWST-TST),
25
which uses Guaranteed Time Observations (GTO) time awarded by NASA in 2003 (PI: M. Mountain) for studies in three different subject areas: (a) transiting exoplanet spectroscopy (lead: N. Lewis), (b) exoplanet and debris disk high-contrast imaging (lead: M. Perrin), and (c) Local Group proper-motion science (lead: R. van der Marel). A common theme of these investigations is the desire to pursue and demonstrate science for the astronomical community at the limits of what is made possible by the exquisite optics and stability of JWST. The present Letter is part of our work on transiting exoplanet spectroscopy, which focuses on Deep Reconnaissance of Exoplanet Atmospheres using Multi-instrument Spectroscopy (DREAMS; see, e.g., D. Grant et al.
2023a
; A. Gressier et al.
2025
; D. Valentine et al.
2024
; D. R. Louie et al.
2025
) of three transiting exoplanets representative of key classes: hot Jupiters (WASP-17 b; GTO 1353), warm Neptunes (HAT-P-26 b; GTO 1312), and temperate terrestrials (TRAPPIST-1 e; GTO 1331).
Our work is divided in two manuscripts. The current Letter presents the JWST NIRSPec/PRISM data reduction and analysis, as well as the necessary methodologies to interpret our transmission spectra, which, as we show throughout this work, we believe are dominated by stellar contamination. A companion paper (A. Glidden et al.
2025
) provides a deep dive on the implications of our atmospheric constraints for possible secondary atmospheres on TRAPPIST-1 e. The current Letter is structured as follows. In Section
, we introduce our observations and data reduction framework. In Section
, we present our results from modeling both the white-light and spectrophotometric light curves, as well as the retrieval framework used in our work to extract inferences from possible planetary atmospheres on TRAPPIST-1 e, which introduces a novel Gaussian process (GP) framework to deal with stellar contamination. Section
offers a discussion of our results, contextualizing them with previous constraints and observations. Our final conclusions are presented in Section
2. Observations and Data Reduction
2.1. Observations
Observations targeting a primary transit of TRAPPIST-1 e with JWST/NIRSpec PRISM were obtained by GTO 1331 (N. Lewis et al.
2017
) on 2023 June 22, June 28, July 23, and October 28. Target acquisition was performed on each visit on TRAPPIST-1 itself. These four transit observations consisted of 4 hr, five groups per integration exposures using the
SUB512
subarray, which gave a cadence of 1.38 s per integration and allowed plenty of (a) time baseline to observe the ∼1 hr transit events and (b) nonilluminated detector space to subtract possible background signals and other detector systematics. While this setup allowed us to obtain spectrophotometry in a wide wavelength range (0.6–5
m), it did make some pixels in the 1.1−1.7
m region reach fluence levels above 90% of the saturation level of the detector in the last group, which, as has been discussed in the literature, requires special care (Z. Rustamkulov et al.
2023
; A. L. Carter et al.
2024
).
2.2. Data Reduction
The data were reduced with a variety of pipelines to validate the robustness of the signals observed in our transmission spectra; the details of this comparison and of each data reduction pipeline are given in Appendix
. In what follows, we make use of the results obtained by using the
transitspectroscopy
pipeline (N. Espinoza
2022
), which are detailed in Appendix
A.1
. In short, this mainly makes use of the
JWST Calibration Pipeline
’s Stage 1 to perform detector calibration of
*uncal.fits
files all the way to the rates per integration (H. Bushouse et al.
2022
); the only different step is a custom jump detection algorithm. The 1/
noise reduction is performed at the rates per integration level using a similar methodology as the one introduced in M. Radica et al. (
2023
) for NIRISS/SOSS. The spectral traces of each integration are obtained via cross-correlation at each column with a Gaussian, whose peaks are fitted by a B-spline to smooth the trace at each integration. Spectral extraction is performed via simple extraction.
Light curves are then fitted with the
juliet
library (N. Espinoza et al.
2019
). For the band-integrated, white-light light curves, we let the limb-darkening coefficients be free parameters using a quadratic limb-darkening law, where we use the D. M. Kipping (
2013
) parameterization for limb darkening. While as noted in L.-P. Coulombe et al. (
2024
) in general this parameterization might introduce biases for JWST-quality light curves, we believe this is not important in our case as the band-integrated flux for NIRSpec/PRISM is dominated by flux of <2
m, where this same work shows that this effect is minimal. However, for ease of comparison across data reduction pipelines, we decided to fix the limb-darkening coefficients in the wavelength-dependent light-curve fits to the ones predicted by PHOENIX models using the
limb-darkening
library (N. Espinoza & A. Jordán
2015
). A GP (S. Aigrain & D. Foreman-Mackey
2023
) is used to model systematic trends in the observed light curves. The band-integrated and wavelength-dependent light curves were fitted with three possible kernels: an exponential, a Matèrn 3/2, and an exponential Matèrn 3/2 kernel (i.e., a multiplication of both). We found identical results with the three; here we show the results for the Matèrn 3/2 kernel to parameterize the GP using the
celerite
library (D. Foreman-Mackey et al.
2017
). The use of GPs produces, overall, slightly larger error bars on the estimated transmission spectrum for the four visits, which is one of the reasons why we decide to use this pipeline to perform inferences on, as it gives rise to overall more conservative error bars on the final transmission spectrum (although, as described in Appendix
, with variations seen as a function of wavelength consistent within reductions for the four visits).
3. Results
3.1. White-light Light-curve Analysis
Figure
showcases a close-up of the observed transit events of TRAPPIST-1 e on the white-light light curves for each of our observation dates using the data reduction procedure described above as gray points in order to showcase the overall data quality of our observations. The priors and posteriors used for each individual light-curve fit (i.e., each visit) are presented in Table
, and the best-fit light curves are presented in black along with the corresponding residuals in Figure
Figure 1.
White-light TRAPPIST-1 e JWST/NIRSpec PRISM transit light curves. (Top) Data points of the transit event (gray; binned at a cadence of 14 s) along with the best-fit transit plus systematics model (black; which includes a visit-long slope and a GP; see text for details). The date at which each observation was obtained is indicated at the top of each panel. (Middle) Residuals of the data minus the best-fit light-curve model in parts per million; the rms of the data at this 14 s cadence is indicated for each observation. (Bottom) Light curves at the H
line (0.656 ± 0.02
m, in parts per thousand) at the same cadence as the white-light curves (small light blue) and binned at 5 minutes (large dark blue); note how a flare is revealed at about −0.25 hr in this light curve as the most likely explanation for the bump in the 2023 July 23 transit event.
Download figure:
Standard image
High-resolution image
Table 1.
Prior and Posterior Parameters of the White-light (i.e., Band-integrated) Light-curve Fits Performed on the NIRSpec/PRISM Data of TRAPPIST-1 e
Parameter
Prior
Jun 22
Jun 28
Jul 23
Oct 28
Combined
Physical and Orbital Parameters
(s)
(0, 3
pred
(s)
(0, 8640
(ppm)
(0.0, 0.2) on
(0.191, 0.041
(52.86, 0.39
Fixed
(deg)
Fixed
90
90
90
90
90
Limb-darkening Coefficients
(0, 1) on
(0, 1) on
Instrument Systematics
(ppm)
(0, 10
GP
(ppm)
GP
(hours)
(ppm)
Notes.
For the priors,
) stands for a normal distribution with mean
and variance
) stands for a uniform distribution between
and
, respectively; and
stands for a log-uniform prior on the same range. Priors for times of transit
pred
, period
, impact parameter
, and semimajor axis to stellar radius ratio
come from E. Agol et al. (
2021
). Priors are for the individual light-curve fits of each visit. Combined posterior (last column) corresponds to the average of the four independent fits.
Here,
= 6.101013 days is the best-fit period in E. Agol et al. (
2021
).
pred
are the predicted transit times for our visits in E. Agol et al. (
2021
), which are 2460118.460787, 2460124.55984, 2460148.956812, and 2460246.538176 for 2023 June 22, June 28, July 23, and October 28, respectively. All times in BJD TDB.
Quadratic limb-darkening law; priors were set on the
and
parameters using the transformations in D. M. Kipping (
2013
) to obtain
and
The transit model is multiplied by 1/(1 +
) to account for possible normalization offsets made to the light curves when obtaining relative fluxes. For details, see Section 2.1 in N. Espinoza et al. (
2019
).
Here,
GP
and
GP
represent the amplitude and timescale of a Matèrn 3/2 GP;
represents jitter added in quadrature to error bars.
Download table as:
ASCII
Typeset image
Overall, the white-light curves showcase very high precision, albeit at higher noise levels than what would be expected by the estimated error bars by the JWST pipeline (as is true in most JWST white-light transit light curves, very likely stemming from residual 1/
noise; see, e.g., N. Espinoza et al.
2023
; A. L. Carter et al.
2024
). However, it is evident that different epochs present different levels of systematic trends. The 2023 July 23 visit, for instance, presents an evident bump just before midtransit, which, following W. S. Howard et al. (
2023
) and M. Radica et al. (
2023
), we identify as a small flare, judging from the shape of the light curve at H
wavelengths (Figure
, bottom panel); similar small oscillations are observed in the 2023 June 22 light curve, with the rest of the light curves not showcasing as strong systematic effects. Indeed, this is reflected in the best-fit amplitudes of the GPs (
GP
parameter) for each visit presented in Table
—the amplitude is tightly constrained to be about a few hundred ppm for both of those visits, whereas the parameter is loosely constrained for the 2023 June 28 visit (where a lower level of systematic effects are observed) and appears to be slightly smaller for the 2023 October 28 visit. Interestingly, the timescales of the process all appear to be relatively similar, on the order of 5–10 minutes (except, again, for the 2023 June 28 visit, where the parameter is also loosely constrained). Investigations on the time series of the parameters of the 2D spectra, such as trace motion and profile widths, reveal no obvious correlation with the variability observed in our light curves. Similar studies performed on the JWST guide star data using the
spelunker
library (D. Deal & N. Espinoza
2024
) provided similar null correlation results. This suggests that the systematic trends observed in our light curves might actually be due to time-varying phenomena produced by TRAPPIST-1 (the star) itself, such as flares or some type of stellar oscillation. While studying the detailed origin of these is outside the scope of this work, we do account for them in our wavelength-dependent light-curve fitting via GPs. We did find that other methodologies give rise, however, to similar results (see Appendix
for details on this comparison).
Another interesting element of the individual visit results is the larger white-light transit depth on the first visit of the program on 2023 June 22. This difference is particularly curious given the fact that the depth difference is the largest when compared against the 2023 June 28 visit—which happened only 6 days after the June 22 visit (or about two stellar rotation periods, considering a rotation period of 3.3 days as measured in B. M. Morris et al.
2018
). The transit depth difference between those two visits is
ppm, which is a difference that is significant at more than 5
—and one that can be visually inspected in Figure
as well. As will be shown in the next section, this variability is observed at the same absolute level in
all
of our data reduction methodologies, which strongly suggests that this is real variability in the transit depths—very likely stemming from stellar contamination (B. V. Rackham et al.
2018
).
Despite the above effects, our white-light-curve results in Table
already show a significant improvement in the orbital parameters of TRAPPIST-1 e by a factor of ∼3 in both the impact parameter and the scaled semimajor axis and by a factor of ∼50 in the predicted time of transits from E. Agol et al. (
2021
). As has been shown in E. Agol et al. (
2024
), the latter are of particular importance for improving the overall ephemerides for the planetary system around TRAPPIST-1. This showcases once again that JWST white-light curves, although typically by-products of atmospheric characterization, provide excellent data sets to refine planetary system properties at levels that were unattainable by previous instrumentation (see, e.g., A. L. Carter et al.
2024
; A. S. Mahajan et al.
2024
and references therein).
3.2. The Transmission Spectra
The transmission spectra obtained for our different observing dates are presented in Figure
. These were obtained by following the same procedures described above for the white-light light-curve analysis but fixing the orbital parameters (i.e., the time-of-transit center, impact parameter, scaled semimajor axis, eccentricity, and argument of periastron) as well as the limb-darkening coefficients using a quadratic law, as described in Section
2.2
. As can be observed, the data showcase an overall flatter transmission spectrum from 0.6 to 5
m on the 2023 June visits and strong increases in transit depth toward longer wavelengths for the visits of 2023 July and October. As we show in Appendix
, these variations in the transmission spectra for different epochs are observed in
all
of our reductions, which use different pipelines to reduce the raw data and different methodologies to fit the transit light curves themselves, which showcases the robustness of the observed features in the transmission spectra.
Figure 2.
TRAPPIST-1 e NIRSpec/PRISM transmission spectra on different epochs. The transmission spectra are ordered in chronological order from top to bottom, with the dates on which they were obtained indicated in the lower right of each panel. The
-axes for all plots cover the same ranges, which highlight how much the transmission spectrum varies across epochs—in particular for the transmission spectra obtained in 2023 July and October.
Download figure:
Standard image
High-resolution image
Given that TRAPPIST-1 is well known as a magnetically active star, shown to frequently flare and host both hot and cold spots on the stellar surface through evidence in photometric monitoring campaigns and transmission spectra (see, e.g., B. M. Morris et al.
2018
; H. R. Wakeford et al.
2019
; E. Ducrot et al.
2020
; W. S. Howard et al.
2023
; O. Lim et al.
2023
; M. Radica et al.
2025
), we interpret those epoch-to-epoch variations as a function of wavelength in our observed transmission spectra as mostly being the product of stellar heterogeneities contaminating our transmission spectra, which are evolving in time.
Initial attempts at using existing retrieval analysis tools that include stellar contamination modeling to explain our observed transmission spectra were unsuccessful at reproducing their complex observed variation as a function of wavelength. As we show in Appendix
, while publicly available stellar models were adequate to model the visits in 2023 June, from which we infer the possible existence of heterogeneities colder than the stellar photosphere (i.e., “cold” spots), they are unsuccessful at modeling the visits on 2023 July 23 and 2023 October 28, both of which showcase strong evidence for hot stellar heterogeneities (i.e., “hot” spots). Those experiments suggest that, although our data present strong evidence for stellar contamination in the transmission spectrum of TRAPPIST-1 e, performing inferences on them is not straightforward with current stellar models and/or retrieval frameworks. It is very likely that this is due to the limitations of using 1D radiative/convective equilibrium stellar atmosphere models with different temperatures as a proxy for the spectral features of cold and hot “spots,” which are known to arise in magnetically active regions and thus give rise to very complex emergent fluxes (see, e.g., V. Witzke et al.
2022
; C. M. Norris et al.
2023
; H. N. Smitha et al.
2025
). We thus decided to develop a new methodology to perform joint stellar contamination and atmospheric retrievals on our observed TRAPPIST-1 e transmission spectra using GPs to incorporate our limited knowledge on the underlying data-generating process.
3.2.1. Atmospheric Inferences Using GPs
The complex structure of our observed transmission spectra points to opacity sources and/or physical mechanisms defining the emergent spectra of cold and hot “spots”—or even the photospheres of M dwarfs—that are not included in current publicly available stellar models (such as, e.g., magnetic field impacts on the spectra of “hot” spots; C. M. Norris et al.
2023
) or even as of yet unidentified systematic effects impacting all of our data reduction pipelines. This makes inferences on the possible atmospheric composition of TRAPPIST-1 e from our observed transmission spectra not straightforward to perform with either forward models, model grids, or atmospheric retrievals, as all of those that include models for stellar contamination parameterize its impact using models similar to the ones discussed above (see, e.g., R. J. MacDonald & N. E. Batalha
2023
and references therein).
At the low resolutions we are dealing with in this work, both stellar and exoplanetary spectra are expected to be relatively smooth functions with well-defined length scales. Motivated by this assumption, along with the versatility of GPs to model and account for unknown systematic trends in transit light curves with such properties, here we decide to model the unknown signals that distort our transmission spectra using these processes as well. The framework for incorporating GPs in atmospheric retrievals has already been introduced in the literature for transmission spectra in the case of additive signals that distort the spectra due to unknown systematic effects (see, e.g., G. Guilluy et al.
2024
; P. McCreery et al.
2025
; Y. Rotman et al.
2025
). The difference in incorporating it in our work, however, is that here we are interested in modeling a signal that is
multiplicatively
distorting our transmission spectra, as stellar contamination acts at first order multiplicatively on a transmission spectrum (see, e.g., B. V. Rackham et al.
2018
and references therein). Multiplicative GPs, however, are not straightforward to derive. To circumvent this problem, instead of modeling the observed transit depth of a visit
) at the
th wavelength bin
, we model the
logarithm
of this transit depth as
This converts our multiplicative problem into an additive one. Here,
) is the (visit-dependent, deterministic) stellar contamination signal that distorts the exoplanet atmospheric signal
),
is a (visit-dependent) constant transit depth offset injected by the unknown zero-point of the stellar photosphere,
represents a GP for visit
, and
is a white-noise component that incorporates both the observed transit depth errors at each
but also allows for a jitter term
added in quadrature to those error bars to account for possible underestimated error bars on the observed transit depths. It is important to note that this methodology of modeling observables in logarithm to include multiplicative signals—either GPs or linear models—is not new and is already employed in the photometric time-series literature (see, e.g., N. Espinoza et al.
2019
; I. C. Weaver et al.
2020
; N. H. Allen et al.
2022
and references therein).
We provide the details of our GP atmospheric retrieval methodology—including the priors used in our retrievals—in Appendix
. In short, our retrieval framework uses dynamic nested sampling via the
dynesty
library (J. S. Speagle
2020
) to explore the parameter space, using the
POSEIDON
library (R. J. MacDonald & N. Madhusudhan
2017
; R. J. MacDonald
2023
) to perform the radiative transfer and compute forward models
). Following Z. Lin et al. (
2021
) and J. Lustig-Yaeger et al. (
2023
), we consider H
, CO
, CH
, H
O, N
, O
, O
, N
O, and CO as the spectrally active species in this modeling framework. We follow B. Benneke & S. Seager (
2012
) and use a centered log-ratio transformation to model the mixing ratios in order to consider any combination of the spectrally active molecules in our retrievals to be the background gas. We include the impact of clouds via a cloud-top pressure parameter, which we here interpret as an effective surface pressure for the atmospheres under study—we also allow for the reference pressure to be a free parameter in our retrievals. The deterministic part of the stellar contamination signal,
), is built following the formalism in B. V. Rackham et al. (
2018
), for which we use BT-SETTL models (F. Allard
2014
) to incorporate both hot and cold stellar heterogeneities. For our GP (which we interpret as the stochastic part of our stellar contamination model), we use a Matèrn 3/2 kernel via the
george
library (S. Ambikasaran et al.
2014
). We also experimented using a squared-exponential kernel, finding the very same results we showcase below with the Matèrn 3/2 kernel.
3.3. GP Retrieval Results
We explored performing retrievals of different complexity when it came to defining the stellar contamination model in our framework. We tried performing two-component (i.e., a spot and a photosphere) and three-component (i.e., a “hot” spot, a “cold” spot, and a photosphere) stellar contamination models, as well as including or not the GP components. We also tried models in which instead of having a different stellar contamination model, a global stellar contamination model—common to all visits—was defined. From all those combinations, the models with the largest log evidences (which are models that have
when compared to all the other model combinations) were two models that set the “deterministic” stellar contamination model
to 1 (i.e., no deterministic stellar contamination component) and that absorb the large, hundreds of ppm variations likely coming from stellar contamination in our observed transmission spectra into the GP. The first was a model with a GP term per visit and an exoplanet atmosphere on TRAPPIST-1 e, while the second was a model with a GP term per visit with a featureless spectrum per visit (i.e., with
) = 0 in our notation above). Both models and the corresponding combined, corrected transmission spectrum by the GP components are presented in Figure
Figure 3.
The transmission spectra of TRAPPIST-1 e interpreted with GPs and atmospheric/atmosphereless models. (Top) Transmission spectra on our four visits (black points with error bars) modeled with a GP times either an atmospheric model (blue) or a flat-line spectrum (i.e., with no atmosphere or with a high-altitude cloud deck; orange); a GP (offset; dashed lines) acts multiplicatively to distort those signals. Bands represent the 1
and 3
credibility bands. (Bottom) Visit-combined transmission spectrum by (weighted) averaging the four visits after correcting for the modeled GP component (using the flat-line model-derived GP; black points with error bars). The atmospheric model and the flat-line model are indistinguishable according to the Bayesian evidence—more data are needed to distinguish between those. Bands represent the 1
and 3
credibility bands. Note how, within the error bars, an Earth-like model (gray; with the locations of the main active spectroscopic features) is still consistent with our data. Also note that the blue and orange models are shared but fitted to each individual visit.
Download figure:
Standard image
High-resolution image
One of the striking features of the GP-corrected spectra presented in Figure
is the precision that we achieve in our four-visit-combined (via a weighted mean) transmission spectrum (bottom panel)—we are able to unveil a spectrum with error bars on the order of 50 ppm at
= 30 in the 0.6–5
m range, which significantly expands both in precision and wavelength prior HST/WFC3 constraints on the transmission spectrum of TRAPPIST-1 e (from, e.g., J. de Wit et al.
2018
; Z. Zhang et al.
2018
). The difference in log evidence between models that include exoplanetary atmospheric features (blue model in Figure
) and those that do not (orange model) is
in favor of the no-atmosphere model using all the molecules (27 total free parameters). However, reducing the network to only H
, CO
, CH
, H
O, and CO (the molecules we expect to show the highest impact on our NIRSpec/PRISM transmission spectra) raises the Bayesian evidence of the atmospheric model to a level that makes it indistinguishable from the featureless model (
in favor of the featureless model). Based on those analyses, we find that given our current data, we are unable to distinguish between models containing exoplanet atmospheric features and models that are featureless.
3.4. Constraints on Possible Atmospheric Compositions of TRAPPIST-1 e
Our precise NIRSpec/PRISM spectra, together with our GP retrieval methodology, allow us to put novel constraints on the possible atmospheric compositions of TRAPPIST-1 e. To illustrate the power of these constraints—and the improvement over previous measurements—we applied the same GP retrieval methodology introduced above to the previous state of the art in transmission spectroscopy for TRAPPIST-1 e: the two HST/WFC3 visits introduced in J. de Wit et al. (
2018
) for this exoplanet, reanalyzed and studied in detail in the work of Z. Zhang et al. (
2018
). We provide details of our retrieval framework applied to the HST/WFC3 data presented in Z. Zhang et al. (
2018
) in Appendix
. For illustration and comparison, we present constraints on possible primary, H
-dominated atmospheres with our methodology using both these two visits and the four JWST/NIRSpec visits introduced in this work in Figure
Figure 4.
abundance constraints for TRAPPIST-1 e from HST and JWST as a function of surface pressure. Posterior distribution showcasing the improvement on constraints on possible H
-dominated atmospheres on TRAPPIST-1 e between HST (left in gray; obtained by applying our GP retrieval methodology to the HST/WFC3 data in Z. Zhang et al.
2018
) and JWST (right in blue; obtained by applying it to the four NIRSpec/PRISM transits presented in this work). The distribution for HST mainly follows the centered log-ratio prior allowing the H
-dominated solution at virtually all pressures ≳1 bar; the JWST one disfavors the H
-dominated solution.
Download figure:
Standard image
High-resolution image
Studying the HST/WFC3 marginal posterior distribution on the H
abundance constraints for TRAPPIST-1 e (i.e., the gray histogram in Figure
), we are in fact unable to rule out H
-dominated atmospheres at the 3
level. However, we are able to rule out abundances larger than about 80% by volume with our four JWST/NIRSpec data (blue posterior distributions) even in cloudy/low-pressure scenarios at more than a 3
level. Only 1% of the posterior samples, in fact, allow for H
abundances larger than 50% by volume when using our JWST/NIRSpec data, making the H
-dominated scenario (and thus primary atmosphere scenarios for TRAPPIST-1 e) very unlikely given our JWST/NIRSpec data even in the presence of clouds. A detailed presentation and study of our posterior constraints on possible
secondary
atmospheres for TRAPPIST-1 e as inferred from our GP retrieval framework are presented in a companion paper (A. Glidden et al.
2025
).
4. Discussion
The JWST/NIRSpec PRISM transmission spectrum presented in this work for the HZ exoplanet TRAPPIST-1 e is one of the most precise measurements and constraints on the atmospheric composition of a rocky HZ exoplanet to date. Among the key lessons from our observations is that stellar contamination—which distorts our observed transmission spectrum due to unocculted hot and cold “spots” in the star—is one of the biggest challenges when it comes to inferring atmospheric properties of the exoplanets in the TRAPPIST-1 system. While this has been shown to be the case as well with NIRISS/SOSS observations at wavelengths <3
m (see, e.g., O. Lim et al.
2023
; M. Radica et al.
2025
), our work reveals that this might be a problem even for longer wavelengths, where an important number of strong possible absorbers (such as, e.g., CH
and CO
; see Figure
, right panel) are located for temperate, rocky worlds like TRAPPIST-1 e. We do showcase, however, that using data-driven methodologies such as GPs can aid in modeling signals that our stellar models might not be yet ready to account for, allowing us to perform inferences on the transmission spectra which include constraints on the possible atmospheric compositions of TRAPPIST-1 e. A detailed overview of the physical constraints our observations put on possible secondary atmospheric compositions for TRAPPIST-1 e is presented in a companion paper (A. Glidden et al.
2025
); below, we discuss some insights we can extract from our presented observations and methodology, as well as future prospects for further characterization of TRAPPIST-1 e.
4.1. Stellar Contamination beyond 3
One of the most striking features of our observed transmission spectra is their strong epoch-to-epoch wavelength-dependent variations. Intuitively, stellar contamination appearing at wavelengths <3
m at the level observed by prior work (O. Lim et al.
2023
; M. Radica et al.
2025
) is expected as the result of possible water bands evolving in cold and hot spots in the stellar surface. However, the strong variation at longer wavelengths observed in our transmission spectra along with the inability of stellar models to properly fit the observed variations might seem to counter some of that intuition that would suggest stellar contamination should be smaller at wavelengths past 3
m, where water bands might not be strong opacity sources anymore.
Predictions for stellar contamination for late M dwarfs such as TRAPPIST-1, however, do point out that depending on the nature of the spots, the impact at longer wavelengths might not be negligible (see, e.g., B. V. Rackham et al.
2018
and references therein). In addition, recent work on modeling the emergent flux of hot and cold spots has also highlighted the need to incorporate the impact of magnetohydrodynamic effects to properly model it (V. Witzke et al.
2022
; C. M. Norris et al.
2023
). Another line of evidence for expecting variability and stellar contamination at longer wavelengths might also come from recent JWST variability studies of brown dwarfs. While colder objects than TRAPPIST-1, the recent variability monitoring over the same wavelength range as our TRAPPIST-1 observations of the brown dwarfs WISE 1049AB and SIMP 0136+0933 reveals ample variability as well in timescales of hours at wavelengths >3
m mainly driven by CH
and CO variability on those objects (B. A. Biller et al.
2024
; A. M. McCarthy et al.
2025
). While there is no one-size-fits-all explanation as to the nature of this brown dwarf variability, this is thought to be driven by a complex mixture of various nonequilibrium processes, including condensation and vertical mixing, as well as sampling emergent flux that might be a mixture of contributions from different pressure levels in their atmospheres. It is not unthinkable for TRAPPIST-1 to be developing a similar level of complex physical processes on its surface, which might complicate its modeling even further.
As demonstrated in our work, methodologies such as GPs exist to marginalize over “unknown” time-variable signals, which in this work (and in our companion paper; A. Glidden et al.
2025
) allowed us to perform various inferences on the transmission spectrum of TRAPPIST-1 e—including constraints on its possible atmospheric makeup. This methodology, however, has its own limitations. Implicit in our modeling framework introduced in Section
and Appendix
, for instance, is the idea that any time-varying signal in the transmission spectrum of TRAPPIST-1 e comes from the star, while any static signal across visits comes from the exoplanetary atmosphere. TRAPPIST-1, however, might possess persistent heterogeneities observable in all the visits, which might also be biasing our exoplanet atmospheric inferences on TRAPPIST-1 e. This is one of the limitations of the presented framework and one we leave to incorporate in future work.
Techniques such as those that suggest, e.g., to use TRAPPIST-1 b as a proxy for stellar contamination at all wavelengths and then use that to “decontaminate” the transmission spectrum of other planets such as that of TRAPPIST-1 e are excellent complementary techniques that could help remove contamination including both time-varying and persistent components in a “model-independent” way (TRAPPIST-1 JWST Community Initiative et al.
2024
; A. D. Rathcke et al.
2025
). The observations of JWST GO 6456 and 9256 (N. Allen et al.
2024
), which attempt to use this technique over 15 transits of TRAPPIST-1 b and e, will be a perfect data set to test the GP retrieval methodology introduced in this work, as persistent features should be present in both transmission spectra—even if time-variable ones vary between close transits of TRAPPIST-1 e and TRAPPIST-1 b. Given the large number of observations, this might also be a perfect program to study possible physical variability mechanisms that might explain the level and wavelength variability on TRAPPIST-1 as well.
4.2. Primary Atmosphere Constraints on TRAPPIST-1 e
While with our current data, we are unable to distinguish whether TRAPPIST-1 e has an atmosphere or not, our precise four-visit JWST/NIRSpec PRISM spectra, combined with our GP methodology, as introduced in Section
, allows us to put novel constraints on possible compositions for TRAPPIST-1 e if it were to have an atmospheres.
As showcased in Section
, our work puts particularly strict limits on possible primary, H
-dominated atmospheres present in TRAPPIST-1 e. Prior works attempting to constrain its atmospheric composition were only able to rule out cloud-free, H
-dominated atmospheres (see, e.g., J. de Wit et al.
2018
). Cloudy H
-dominated atmospheres were still allowed, however, as the wavelength range of HST/WFC3 was unable to constrain the amplitude of CH
and CO
features (located mostly at wavelengths >2
m), which in such a scenario would be very large. On top of this, given stellar activity, studies such as the one from Z. Zhang et al. (
2018
) suggested that the transmission spectra might not be as constraining for TRAPPIST-1 e’s exoplanet atmosphere as previously thought—being, in turn, well explained instead by arising fully from stellar contamination. Applying our GP retrieval methodology to the HST/WFC3 data presented in Z. Zhang et al. (
2018
), we indeed reach the same conclusions as those previous works: at cloud-top pressures above about ∼1 bar, H
-dominated scenarios are all fully consistent with the data at the 1
–2
level.
Using our four JWST NIRSpec/PRISM visits together with our GP retrieval methodology, however, we were able to showcase that it is very likely that TRAPPIST-1 e
does not
possess an H
-dominated atmosphere even in cloudy scenarios and even in the face of stellar contamination. As we show in Section
, the probability that TRAPPIST-1 e has an H
volume mixing ratio larger than 50% is less than 1% given our data and modeling framework. This allows us to rule out mixing ratios larger than about 80% at more than the 3
level. These results are, in turn, in agreement with predictions from Y. Hori & M. Ogihara (
2020
) that suggest this hydrogen-dominated scenario to be unlikely from hydrodynamic escape calculations. Interestingly, the highest-probability regions of the H
mixing ratios inferred from our JWST retrievals (Figure
, right panel) are consistent with those found or predicted to be on Earth, Mars, and Venus—i.e., of order 10
−6
−10
−9
by volume (D. H. Ehhalt et al.
1977
; J. D. Patterson et al.
2020
; A. Kleinböhl et al.
2024
; Z. Wang et al.
2025
). A detailed study of the constraints our observations imply for possible secondary atmospheres such as the ones in those planets as traced by other molecules (such as, e.g., CO
and CH
), including constraints on mean molecular weights, is presented in a companion paper (A. Glidden et al.
2025
).
5. Conclusion
In this work, we present four JWST/NIRSpec PRISM transmission spectra of TRAPPIST-1 e obtained in mid-to-late 2023. We show that these transmission spectra, rather than being featureless, exhibit significant variability in both time and wavelength. We interpret this variability as arising from stellar heterogeneities in the host star, TRAPPIST-1—i.e., due to the transit light source effect (B. V. Rackham et al.
2018
). While we can qualitatively explain the observed features in those spectra as arising from possible hot and cold spots on the stellar photosphere, we are unable to fit the observed spectroscopic variations with stellar model atmospheres alone. In order to perform inferences on our transmission spectra and put constraints on the possible atmospheric makeup of TRAPPIST-1 e, we resort to using GPs to model the stellar contamination in our spectra, which allows us to perform joint exoplanet atmospheric retrievals on our data and put new constraints on the possible atmospheric compositions of TRAPPIST-1 e.
We show that with the current data set, we are unable to distinguish between an atmosphere and atmosphereless scenario for TRAPPIST-1 e, despite being able to constrain atmospheric features down to ∼50 ppm at
= 30 in the 0.6–5
m range. This level of precision does allow us, however, to rule out possible cloudy H
, with which we conclude that primary atmospheres are unlikely in TRAPPIST-1 e. A detailed study of the possible secondary atmospheres on TRAPPIST-1 e and how they compare to our own solar system objects is presented in a companion paper (A. Glidden et al.
2025
). We note how the observations of JWST GO 6456 and 9256 (N. Allen et al.
2024
) will be critical to constrain both stellar contamination using methodologies such as the one introduced in this work and others introduced in the literature (TRAPPIST-1 JWST Community Initiative et al.
2024
; A. D. Rathcke et al.
2025
) and the possible atmospheric makeup of TRAPPIST-1 e. Our work does highlight, however, how JWST is breaking ground in the study of rocky HZ exoplanet atmospheric compositions.
Acknowledgments
We thank the anonymous referee for their helpful and timely comments, which improved this manuscript. Some/all of the data presented in this Letter were obtained from the Mikulski Archive for Space Telescopes (MAST) at the Space Telescope Science Institute. The specific observations analyzed can be accessed via doi:
10.17909/yzwd-vq54
. All figures in this Letter, along with the associated data, can be accessed at doi:
10.5281/zenodo.16125662
This Letter reports work carried out in the context of the JWST Telescope Scientist Team (
; PI: M. Mountain). Funding is provided to the team by NASA through grant 80NSSC20K0586. This work is based on observations made with the NASA/ESA/CSA James Webb Space Telescope. The data were obtained from the Mikulski Archive for Space Telescopes at the Space Telescope Science Institute, which is operated by the Association of Universities for Research in Astronomy, Inc., under NASA contract NAS 5-03127 for JWST. These observations are associated with program #1331 (PI: Lewis). N.H.A. acknowledges support by the National Science Foundation Graduate Research Fellowship under grant No. DGE1746891. This material is based upon work performed as part of the Consortium on Habitability and Atmospheres of M-dwarf Planets (CHAMPs) team, supported by the National Aeronautics and Space Administration (NASA) under grant No. 80NSSC23K1399 issued through the Interdisciplinary Consortia for Astrobiology Research (ICAR) program. C.I.C. acknowledges support by NASA Headquarters through an appointment to the NASA Postdoctoral Program at the Goddard Space Flight Center, administered by ORAU through a contract with NASA. D.R.L. acknowledges support from NASA under award number 80GSFC24M0006.
Facility:
JWST - James Webb Space Telescope (NIRSpec).
Software:
JWST Calibration Pipeline (H. Bushouse et al.
2022
), Eureka (T. J. Bell et al.
2022
), transitspectroscopy (N. Espinoza
2022
), ExoTiC-LD (D. Grant & H. R. Wakeford
2024
), ExoTiC-JEDI (L. Alderson et al.
2022
).
Appendix A: JWST Data Reduction
In what follows, we present all the data reduction steps and pipelines used to obtain the transmission spectrum for TRAPPIST-1 e for our four transit observations. In total, we compared our data reduction procedures across five different data reduction pipelines. A summary with all the median-subtracted transmission spectra is presented in Figure
A1
, where we showcase how all our data reduction procedures obtain very similar transmission spectra on our different visits.
Figure A1
TRAPPIST-1 e NIRSpec/PRISM transmission spectra on different epochs for different pipelines. Analogous to Figure
but comparing the transmission spectra obtained by different pipelines/reductions in our team. Note that reductions with the same pipelines by different individuals are plotted with similar colors (i.e.,
transitspectroscopy
reductions by NE and AG are in blue,
Eureka!
reductions by SC/KS and CC are in orange/red,
ExoTiC
reductions are in green). This is done on purpose for ease of comparison between varying parameters within a given pipeline and comparing reductions with truly different pipelines. Note that we compare the median-subtracted transmission spectrum across different pipelines, as our inference described in Section
does not depend on the absolute transit depth level but on the
shape
of the transmission spectra.
Download figure:
Standard image
High-resolution image
We identify each data reduction first by the name of the main pipeline being used, followed by the initials of the coauthor(s) performing the reduction. NE and AG use the
transitspectroscopy
pipeline. SC/KS and CC use the
Eureka!
pipeline. DG uses the
ExoTiC
pipeline. We quantified the agreement between the pipelines by subtracting the resulting relative transmission spectra of NE to the ones produced by SC/KS and DG and calculating the
-value of a chi-square test on this difference. For all of them, we find
-values > 0.4, which is quantitative evidence that the spectra are in agreement with each other.
In all the descriptions below, columns refer to pixels that follow the wavelength direction of the spectra, and rows are pixels in the cross-dispersion direction.
A.1.
transitspectroscopy
, NE Reduction
The data were reduced using the
transitspectroscopy
pipeline version 0.4.1 (N. Espinoza
2022
), which in turn makes use of the
JWST Calibration Pipeline
(H. Bushouse et al.
2022
). In particular, for the analysis presented in this work, the
JWST Calibration Pipeline
version 1.12.5 was used. The
transitspectroscopy
pipeline starts from the uncalibrated data products for each observation (
*uncal.fits
files) and obtains the rates for each integration using the
JWST Calibration Pipeline
’s Stage 1 with five modifications. (1) We skip the dark current correction, as the dark current counts are negligible for our short group integrations. (2) Instead of using the default saturation reference file in the pipeline, we use a modified one that sets the saturation level at 90% of the level in those files, as this is where we see evident deviations in the ramps from linear even after nonlinearity corrections; the main impact of this change is that fewer outliers are observed in the high-resolution transmission spectrum of TRAPPIST-1 e, which translates into better error bars in particular at wavelengths between 1 and 2
m. (3) After the superbias step, instead of running the reference pixel step (as NIRSpec/PRISM does not have reference pixels) for each group, we take the median of the leftmost 25 columns and the rightmost 25 columns of the spectra (which contain negligible counts from the stellar spectrum), and we remove this value from the entire group to remove group-to-group pedestal changes. (4) Background is removed on a group-to-group basis by taking the median at each column of the top 2 pixels and the bottom 2 pixels and removing that value from each column. (5) We perform our own jump detection for each group as follows. First, we calculate the difference between the fluence
at group
and the corresponding one at group
+ 1 for all integrations
, i.e.,
+1
. Then, a median filter
) with a window of
= 200 integrations is subtracted from this difference time series,
. The median-absolute-deviation-based standard deviation
of this time series
is calculated, and any values deviating by more than 10
from it are flagged as jumps in a given group. This process is repeated for
= 1, 2, 3, 4. After this procedure, the rates per integration are obtained using the ramp-fitting algorithm in the
JWST Calibration Pipeline
. To extract the spectrum from those rates per integration, the spectrum is first traced on each integration by finding centroids via a cross-correlation of the profile at each column with a Gaussian, which is then fitted with a B-spline using eight equally spaced knots between pixel columns 51 and 491. The median of all of those traces is used as
the
trace for all integrations. Using this, background is removed on each column by removing the median of all pixels more than 10 pixels away from the center of the trace, and 1/
noise is removed following the procedure outlined in M. Radica et al. (
2023
). Finally, the spectrum for each integration is extracted via simple extraction using an aperture with a radius of 7 pixels from the center of the trace.
The transmission spectrum of the planet for each visit is obtained by fitting the wavelength-dependent transit light curves. Only the portions within 1 hr from midtransit of the light curves are fitted, and these are, in turn, binned from the original 1.38 s cadence to 13.8 s cadence light curves by binning them by a factor of 10 in time. The light-curve fits are performed using the
juliet
library (N. Espinoza et al.
2019
). This fits a
batman
transit light-curve model (L. Kreidberg
2015
) at each wavelength where all parameters are fixed to the ones found in the white-light light-curve analysis described in Section
3.1
, except for the planet-to-star radius ratio. Quadratic limb-darkening coefficients are obtained for TRAPPIST-1 e using a 2566 K,
, and solar-metallicity PHOENIX model via the
limb-darkening
library and through the MC-SPAM algorithm as discussed in N. Espinoza & A. Jordán (
2015
). To account for systematic effects, a GP using the
celerite
(D. Foreman-Mackey et al.
2017
) library is fitted to each visit and wavelength—a Matèrn 3/2 kernel in time with a timescale fixed to that found in the white-light light-curve analysis for each visit described in Section
3.1
is used, with the amplitude of the GP being fitted to each wavelength. A jitter term is fitted and added in quadrature to the photometric errors estimated from the pipeline. The sampling is performed with dynamic nested sampling using the
dynesty
package (J. S. Speagle
2020
).
A.2.
transitspectroscopy
, AG Reduction
A second
transitspectroscopy
reduction was performed by AG. The steps followed similar ones to the NE reduction calibrating the uncalibrated data products, with the difference that the default saturation reference file was used. The tracing and spectral extraction were the same as the NE reduction, except for the fact that an extraction aperture of 10 pixels was used. The light-curve fitting setup was the same as the one used for the NE reduction, with the key differences being that (1) no binning in time is performed on the light curves and (2) the GP amplitude is fixed to that found on white-light light-curve fits, and the timescale is fitted at each wavelength. The motivation for the latter is to test the inverse methodology used by the NE reduction using the same pipeline (which fixes the timescale and fits for the GP amplitude). As is shown above, both spectra are nearly identical, showcasing that this assumption is not a particularly important one when it comes to retrieving the transmission spectrum.
A.3.
Eureka!
, SC/KS Reduction
The data were also reduced using the
Eureka!
pipeline (T. J. Bell et al.
2022
) version 0.9, which also makes use of the
JWST Calibration Pipeline
. This particular reduction followed Stage 1 similarly to the
JWST Calibration Pipeline
, with two main differences: (1) the jump step is skipped and (2) group-level background subtraction is performed prior to ramp fitting using pixels at the top and bottom of each column to remove both background counts and 1/
noise. Then, the standard ramp-fitting algorithm from the
JWST Calibration Pipeline
is used to obtain the rates per integration. To trace the spectra, a Gaussian was fitted to each column, and its parameter was used to obtain the center of the trace at each column. This was done for pixel columns 26–451. The spectrum was extracted via optimal extraction using a 3 pixel radius distance from the center of the trace.
The transmission spectrum of the planet was obtained by first binning the flux in wavelength so as to extract 46 wavelength channels (i.e., by adding 9 pixels in the wavelength direction on each bin). Then, light curves were fitted with the T. J. Bell et al. (
2022
) utilities, which include a transit light-curve model fixing all orbital parameters and limb-darkening coefficients to the same ones used in the previously described reductions but leaving the planet-to-star radius ratio as a free parameter in the fits. In addition, a slope and intercept in time are added to the June visits, while a quadratic term was added for the visits of July and October. A multiplier was applied to the error bars of each light curve. In the transit of July 23, where a flare was observed close to midtransit, the flare event was masked out of the fit (i.e., the light curve was masked from about –0.4 to about –0.1 in Figure
).
A.4.
Eureka!
, CC Reduction
A second
Eureka!
reduction was performed by CC following a similar setup as the SC/KS reduction. On the processing of uncalibrated files to rates per integration, the only difference in this reduction was that a custom bias (implemented in the
Eureka!
library) was used instead of the
JWST Calibration Pipeline
standard superbias frame. For spectral tracing, the spectra were traced from pixel columns 51 to 451. For the spectral extraction, optimal extraction was also used, but a 4 pixel radius from the trace's extraction aperture was used, with background subtraction being performed using all pixels with a radius larger than 8 pixels on each column.
For the light-curve fitting, the setup was the same as that of SC/KS, with the differences that a quadratic term was used to account for systematic trends and that the light curves were fitted at pixel-level resolution.
A.5.
ExoTiC
, DG Reduction
The data were also reduced using the
ExoTiC
reduction framework, with the
ExoTiC-JEDI
package (L. Alderson et al.
2022
) used to go from
uncal.fits
to 2D images and a modified version of
ExoTiC-MIRI
(D. Grant et al.
2023b
) used for spectral extraction and light-curve fitting. The
ExoTiC
framework makes use of the
JWST Calibration Pipeline
version 1.8.2, where the dark current correction was skipped and 1/
subtraction was applied on the group level using a custom routine detailed in L. Alderson et al. (
2023
). The jump step was used with a threshold of 15
, and the standard ramp-fitting algorithm of the
JWST Calibration Pipeline
was used. This reduction did not perform tracing on the rates per integration products and performed aperture extraction using row pixel number 15 as the center, adding all pixel values with a total width of 11 pixels centered on this row. The wavelength-dependent light curves were then binned on 0.1
m bins and fitted with a methodology similar to that of the
transitspectroscopy
reduction, with the difference being that instead of fitting a jitter term, a beta factor was used representing a multiplicative factor on the uncertainties to account for excess white and red noise. All integrations were used to fit the light curves. For each light curve, limb-darkening coefficients were calculated using the
ExoTiC-LD
package (D. Grant & H. R. Wakeford
2024
) using PHOENIX stellar models (T.-O. Husser et al.
2013
) and applying the nonlinear four-parameter law fixing all the coefficients to the model computed values.
Appendix B: Modeling Stellar Heterogeneities with Stellar Models
We test the use of both PHOENIX and BT-SETTL stellar models to perform inferences on the transmission spectrum using the NE reduction described above using
exoretrievals
(N. Espinoza et al.
2019
), though overall we find that the BT-SETTL models are a better fit to the data. We chose not to use the SPHINX stellar models (A. R. Iyer et al.
2023
) due to their low resolution (
= 250, lower than parts of the PRISM spectrum) and smaller temperature range (2000–4000 K). We fix log
= 5.2396 (E. Agol et al.
2021
), interpolating between the log
= 5 and 5.5 stellar models to get the appropriate model. We consider hot spots of 2750–5000 K and cold spots of 2300–2450 K for the PHOENIX models (which is as low as the PHOENIX model grids reach) and 1500–2450 K for the BT-SETTL models against a photospheric temperature of 2566 ± 26 K (E. Agol et al.
2021
). We assume that all variations in transit depth with wavelength are due to stellar active regions, modeling the underlying planetary signal as a flat line. However, so that we are not potentially affected by the presence of underlying atmospheric signals, we perform our retrievals with two masks, (1) a 4–4.6
m (CO
) mask and (2) an above 3
m (multiple potential species) mask, to test both their consistency and their ability to predict/match the stellar contamination in the masked wavelengths. Interestingly, this second mask acts as a test of what predictive power modeling wavelengths similar to those modeled in the works of O. Lim et al. (
2023
) and M. Radica et al. (
2025
) for TRAPPIST-1 b and c, respectively, with NIRISS/SOSS has on longer wavelengths.
We find that all four transmission spectra are consistent with the presence of significant stellar contamination (i.e., the Bayesian evidence for all our fits strongly prefers a stellar contamination over a flat-line model for our observations). The first two visits (2023 June 22 and 28) are best fit by a small covering fraction (∼5%) of cold spots with all masks, with the strongest stellar contamination signal in the shortest wavelengths, though the effect is quite small throughout the wavelength range. The covering fractions and spot temperatures between these two visits are consistent with each other, which is interesting given these two visits are separated by close to two rotation periods of TRAPPIST-1 (
rot
= 3.3 days; R. Luger et al.
2017
; E. S. Dmitrienko & I. S. Savanov
2018
; M. Brady et al.
2023
), so we are observing the same portion of the stellar surface in both those observations. Importantly, the best-fit temperature for the BT-SETTL model retrievals, which is the model preferred by the data according to the Bayesian evidence, is below the temperature limit of the PHOENIX grid (2100–2200 K), which shows that for cold M dwarfs, these generic stellar grids may not cover the entire necessary parameter range to model the observed features, at least in transmission spectra contaminated by stellar heterogeneities. The first and second visits are shown with the BT-SETTL model for the first visit (whose shape is consistent with that for the second visit) in Figure
B1
(left panel). The retrieved contamination spectrum and characteristics are largely unchanged between the mask 1 and mask 2 tests.
Figure B1
TRAPPIST-1 e stellar contamination retrievals. Retrievals are carried out with
exoretrievals
on the NE reduction. All models shown use the BT-SETTL stellar models. Left panel: the first and second visits and the best-fit cold-spot model, which is approximately consistent between visits. Middle panel: the third visit, along with hot-spot models from both the mask 1 and mask 2 retrieval tests as described in the text. Here it can be seen that (a) stellar contamination predicted in the longer wavelengths from that seen in the shorter wavelengths does a poor job at matching the observations, and (b) stellar contamination models do a poor job of matching the 1–3 and 3–5
m regions simultaneously. Right panel: the fourth visit with the best-fit hot-spot model. Similar to point (b) from the middle panel, the stellar contamination model cannot fit both the middle wavelengths and the longer wavelengths well.
Download figure:
Standard image
High-resolution image
The third visit, which has the flare event visible during the transit in Figure
, and the fourth visit are instead consistent with a surface dominated by hot spots. Both stellar model grids and visits are best fit by hot-spot temperatures of around 2900 K for mask 1, though the third visit is consistent with a higher covering fraction than the fourth (11% versus 6%). The fourth visit also agrees with this result for mask 2, but the third visit varies significantly for mask 2. Rather than the parameters found above, fitting only the shorter wavelengths for the third visit instead prefers a very small spot covering fraction with a much higher temperature (1% covering of 4100 K spots), which does a very poor job of matching the contamination seen in the longer, masked wavelengths. This shows us that predicting stellar contamination in the longer wavelengths from its appearance in the shorter wavelengths is not a reliable method. Generally, across these two hot-spot-dominated visits, we find that our models are a poor fit to the observational data. In order to fit the contamination in the longest wavelengths, the signal in the middle wavelengths is significantly overestimated, or the inverse, such that we cannot fit the full wavelength range well with our models. We are confident that this mismatch between the models and features in the longest wavelengths is
not
due to atmospheric features, since these features are only visible in the case of a hot-spot-dominated stellar surface. We show the third visit, along with the best-fit BT-SETTL hot-spot models for mask 1 and mask 2, and the fourth visit with the best-fit BT-SETTL hot-spot model in Figure
B1
, middle and right panels, respectively.
We conclude that, although our data present strong evidence for stellar contamination in the transmission spectrum of TRAPPIST-1 e, performing inferences in the transmission spectrum is not straightforward with our current stellar models for TRAPPIST-1. We are perhaps seeing the limits of our current methods, since the use of photospheric models for the modeling of magnetic active regions like spots and faculae is inherently incorrect (see, e.g., V. Witzke et al.
2022
). To get to the point where we are able to correct for stellar contamination to the precision necessary to detect the ∼tens of ppm signals from terrestrial exoplanet atmospheres, we must work toward creating better stellar active region models or change our approach in tackling the problem of stellar contamination.
Appendix C: GP Atmospheric Retrieval Framework
We implemented the retrieval methodology based on Equation (
) as follows. First, we convert our measured transit depths and errors to log space using the transformations
and
, with the former being the depths in log space and the latter being their errors, obtained through the delta method. The log likelihood is then computed independently for each visit using
george
’s
log_likelihood
function, which is then added to form the total log likelihood. We perform our inferences using dynamic nested sampling via the
dynesty
library (J. S. Speagle
2020
). Initially, 5000 live points are set with the
multi
bound and the random walk (
rwalk
) sampler.
For our GP, we use a Matèrn 3/2 kernel, where instead of using the
celerite
(D. Foreman-Mackey et al.
2017
) package approximation, we use the
george
implementation, which handles the exact covariance for this kernel (S. Ambikasaran et al.
2014
). The hyperparameters of our GP
= {
} include a visit-dependent amplitude
with a uniform prior from 0 to 10 dex and a visit-dependent length scale
(in microns) with a uniform prior of 0–100
m for the GP. The jitter term per visit has a uniform prior between 0 and 1000 ppm. The offset
has a uniform prior as well of –3000 to 3000 ppm.
For the exoplanet atmospheric model
), we draw forward models from the
POSEIDON
library (R. J. MacDonald & N. Madhusudhan
2017
; R. J. MacDonald
2023
) at a resolution of
= 10,000, which we then degrade at each step of the sampling using the
POSEIDON.instrument.make_model_data
function, with appropriate inputs calculated via the
POSEIDON.core.init_instrument
function tailored for NIRSpec/PRISM. Motivated by the work of Z. Lin et al. (
2021
) and J. Lustig-Yaeger et al. (
2023
), we consider H
, CO
, CH
, H
O, N
, O
, O
, N
O, and CO as the possible species in our modeling framework. Following B. Benneke & S. Seager (
2012
), we use a centered log-ratio transformation to model the mixing ratios in order to consider any of the spectrally active molecules in our retrievals to be the background gas. At each step of the sampler, we draw
for each element
except for H
, which we derive from the constraint that ∑
= 0. With this, we calculate
, which allows us, using Equation (
C1
), to calculate the individual mixing ratios
that are fed to
POSEIDON
to obtain a forward model. We set uniform priors of –22.47 to 24.17 for
; however, we reject any samples that give rise to
< 10
−12
or
> 1. We also fit for a cloud-top pressure with a log-uniform prior from 10
−7
to 100 bars and a reference pressure that also has a log-uniform prior from 10
−7
to 100 bars, and we use an isothermal temperature/pressure profile, with the temperature being a free parameter as well with a uniform prior between 100 and 300 K. For the star, we use a radius of 0.11697
, an effective temperature of 2559 K, an Fe/H of 0.04, and a log gravity of 5.21. For TRAPPIST-1 e, we use a radius of 0.917985
, a mass of 0.6356
, and an equilibrium temperature of 255 K. For retrievals incorporating GPs and exoplanet atmospheric models, thus, the total number of free parameters is 27.
For the stellar contamination modeling
), we set uniform priors on the temperature of cold “spots” from 1500 to 2450 K and hot “spots” from 2750 to 5000 K. Spot covering fractions also have uniform priors between 0 and 1, and we reject samples that lead to sums of hot and cold covering fractions larger than 1. We use BT-SETTL stellar models to model both spots and the stellar photosphere for TRAPPIST-1 (F. Allard
2014
). We use the same stellar parameters for TRAPPIST-1 as in Appendix
For the flat-line retrievals presented in Section
, we set
) =
) = 0. We set a uniform prior on the offset
of
− 3000 to
+ 3000 ppm, with
= 5176.8 ppm (which is consistent with the planet-to-star radius ratio used to initialize
POSEIDON
, as discussed above).
C.1. JWST NIRSpec/PRISM Retrieval Posteriors
In Figure
C1
, we show a subset of our posterior distributions for the JWST NIRSpec/PRISM atmospheric retrievals following the abovementioned framework, which has several interesting insights. First, note how the atmospheric parameters (cloud-top pressure or surface pressure along with H
mixing ratio) do not show a strong correlation with the GP hyperparameters (length scales
and amplitudes
). Second, note how the GP length scales (
) get progressively smaller for the last two visits, where we see the most prominent trends in the transmission spectrum when it comes to stellar contamination. For visits 1 and 2, the length scales are large, so the GP is mostly a nearly flat line, which is exactly what is observed in Figure
. For visit 3 and particularly for visit 4, the length scale is smaller, which is reflected in the behavior of the GP in the same figure.
Figure C1
Atmospheric and GP hyperparameter posterior distributions. Posterior distribution of some of our JWST NIRSpec/PRISM retrieval parameters, compared to the atmospheric parameters we constrain in this work in Figure
, H
and cloud-top pressure/surface pressure. Note how the GP hyperparameters (length scales
and amplitudes
) do not show strong correlations with the atmospheric parameters.
Download figure:
Standard image
High-resolution image
C.2. HST/WFC3 Retrievals
Finally, for the HST/WFC3 retrievals whose posterior distributions are compared to the JWST ones in Section
using the data in Z. Zhang et al. (
2018
), we use the very same methodology and priors outlined above for the JWST retrievals, although this only incorporates data for the two visits analyzed in that work. The retrievals also use models generated at a resolution of
= 10,000, which are binned to the appropriate HST/WFC3 G141 grism. We present an analogy of Figure
presented in Section
for our JWST results but for the retrievals performed on these HST/WFC3 data in Figure
C2
Figure C2
The TRAPPIST-1 e HST/WFC3 transmission spectra presented in Z. Zhang et al. (
2018
) interpreted with GPs and an atmospheric model. (Left) Transmission spectra of the two HST/WFC3 visits (black points with error bars) modeled with a GP times an atmospheric model (gray), analogous to the top panels in Figure
. Bands represent 1
and 3
credibility. (Right) Visit-combined transmission spectrum obtained by averaging the two WFC3 visits after correcting for the modeled GP component (black points with error bars). The atmospheric model is in gray. Bands represent 1
and 3
credibility. This is analogous to the bottom panel of Figure
. Note that we use the very same
-limits in this figure and in Figure
Download figure:
Standard image
High-resolution image
Footnotes
25
Please wait… references are loading.
10.3847/2041-8213/adf42e