File size: 4,551 Bytes
3c831e9
9e9a167
 
 
3c831e9
 
9e9a167
 
 
 
3c831e9
 
075fb2f
 
 
9e9a167
075fb2f
37a882d
075fb2f
 
 
37a882d
075fb2f
 
 
9e9a167
075fb2f
db91162
075fb2f
9e9a167
075fb2f
 
 
9e9a167
075fb2f
 
 
 
 
9e9a167
075fb2f
9e9a167
 
 
075fb2f
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9e9a167
075fb2f
db91162
075fb2f
fbface2
075fb2f
 
 
9e9a167
075fb2f
fbface2
075fb2f
db91162
075fb2f
6ab5ce1
075fb2f
 
 
6ab5ce1
075fb2f
 
 
 
 
 
9e9a167
075fb2f
6ab5ce1
075fb2f
6ab5ce1
075fb2f
9e9a167
075fb2f
 
 
 
 
 
db91162
075fb2f
5e310c2
075fb2f
9e9a167
075fb2f
41ee37d
075fb2f
 
 
 
 
 
 
 
9e9a167
 
 
075fb2f
 
 
9e9a167
075fb2f
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
---
title: Maya4
emoji: 馃搳
colorFrom: indigo
colorTo: indigo
sdk: static
pinned: true
license: apache-2.0
thumbnail: >-
  https://cdn-uploads.huggingface.co/production/uploads/68a43a86eea45496edb28ba6/hJyITwdmrofMtEtX8qFmW.png
---

<p align="center">
  <img src="Maya4.png" alt="Maya4 Logo" width="780">
</p>

<h1 align="center">Maya4</h1>

<p align="center">
  Multi-level intermediate SAR representations from Sentinel-1 Stripmap acquisitions
</p>

<p align="center">
  <strong>Level-0 to Level-1</strong><strong>Zarr-native</strong><strong>Cloud-accessible</strong><strong>2 TB</strong>
</p>

---

## Overview

**Maya4** is a curated SAR data resource designed to expose the full progression of **Sentinel-1 Stripmap acquisitions** from **raw radar echoes** to **fully focused Level-1 imagery**.

Unlike conventional datasets that provide only final products, Maya4 preserves and organizes the **intermediate signal representations** generated across the SAR processing chain. This makes the dataset particularly suitable for:

- SAR signal processing research
- physics-aware machine learning
- self-supervised pre-training
- representation learning across processing levels
- algorithm benchmarking and reconstruction studies

The name *Maya4* draws from the concept of **M膩y膩**: the idea that reality is revealed through successive layers. In the same way, SAR imagery emerges through a sequence of transformations from raw measurements to interpretable image products.

---

## Why Maya4

<table>
  <tr>
    <td valign="top" width="33%">
      <h3>Multi-level access</h3>
      <p>Provides consistent access to multiple SAR processing stages rather than only the final image product.</p>
    </td>
    <td valign="top" width="33%">
      <h3>Research-oriented structure</h3>
      <p>Designed for analysis of information flow, model pre-training, and development of custom SAR pipelines.</p>
    </td>
    <td valign="top" width="33%">
      <h3>Cloud-native delivery</h3>
      <p>Distributed in <strong>Zarr</strong> format for scalable storage, streaming, and computation.</p>
    </td>
  </tr>
</table>

---

## Dataset Access

| Dataset | Access | Mission / Mode | Format | Size |
|---------|--------|----------------|--------|------|
| **Maya4** | [Open bucket](https://huggingface.co/buckets/ESA-philab/Maya4) | Sentinel-1 Stripmap | Zarr | 2 TB |

---

## Processing Chain

A defining feature of Maya4 is its **sharded multi-level organization**, which preserves the major intermediate states of the SAR focusing pipeline.

<p align="center">
  <img src="https://i.ibb.co/Wv7SXd4N/intermediates.jpg" alt="Maya4 intermediate SAR representations" width="100%">
</p>

| Processing Level | Abbrev. | Description | Technical Value |
|------------------|---------|-------------|-----------------|
| **Raw** | `raw` | Unprocessed radar echoes as acquired by Sentinel-1 | Enables custom end-to-end SAR processing and low-level signal analysis |
| **Range Compressed** | `rc` | Echoes compressed in the range dimension using matched filtering | Improves signal-to-noise ratio and resolves scatterers along range |
| **Range Cell Migration Corrected** | `rcmc` | Echoes after compensation of range migration effects | Preserves geometric consistency and prepares the signal for azimuth focusing |
| **Azimuth Compressed** | `ac` | Fully focused SAR image in slant-range geometry | Corresponds to the interpretable focused SAR image product |

---

## Technical Positioning

Maya4 is intended to support work at the intersection of:

- SAR signal processing
- remote sensing foundation models
- self-supervised and masked modeling approaches
- physics-guided representation learning
- inverse problems and reconstruction
- benchmarking of processing-aware architectures

Because the dataset exposes multiple internal stages of SAR formation, it enables experiments that are not possible with image-only repositories.

---

## Key Characteristics

| Attribute | Value |
|-----------|-------|
| **Mission** | Copernicus Sentinel-1 |
| **Acquisition Mode** | Stripmap |
| **Processing Coverage** | Level-0 to Level-1 intermediates |
| **Primary Distribution Format** | Zarr |
| **Access Paradigm** | Cloud-native bucket access |
| **Primary Target Users** | SAR researchers, ML practitioners, remote sensing scientists |

---

## Acknowledgements

Maya4 is based on data from the **Copernicus Sentinel-1 mission** of the **European Space Agency (ESA)**.

Dataset curation and organization are maintained by the **Maya4 organization**.