You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
To get setup for development, see [this video if you prefer VS Code](https://youtu.be/zddl3n1DCFM), or [this older video if you prefer PyCharm](https://youtu.be/QniQi-Hoo9A), and the instructions below.
## 7. Google Cloud authentication (for legacy tests)
49
+
50
+
To run legacy tests which read data from GCS, you'll need to [request access to MalariaGEN data on GCS](https://malariagen.github.io/vector-data/vobs/vobs-data-access.html).
51
+
52
+
Once access has been granted, [install the Google Cloud CLI](https://cloud.google.com/sdk/docs/install):
## 6. Google Cloud authentication (for legacy tests)
51
+
52
+
To run legacy tests which read data from GCS, you'll need to [request access to MalariaGEN data on GCS](https://malariagen.github.io/vector-data/vobs/vobs-data-access.html).
53
+
54
+
Once access has been granted, install the Google Cloud CLI:
55
+
```bash
56
+
brew install google-cloud-sdk
57
+
```
58
+
59
+
Then authenticate:
60
+
```bash
61
+
gcloud auth application-default login
62
+
```
63
+
64
+
This opens a browser — log in with any Google account.
65
+
66
+
Once authenticated, run legacy tests:
67
+
```bash
68
+
pytest --ignore=tests/anoph -v tests
69
+
```
70
+
71
+
Tests will run slowly the first time, as data will be read from GCS and cached locally in the `gcs_cache` folder.
72
+
73
+
## 7. VS Code terminal integration
74
+
75
+
To use the `code` command from the terminal:
76
+
77
+
Open VS Code → `Cmd + Shift + P` → type `Shell Command: Install 'code' command in PATH` → press Enter.
Copy file name to clipboardExpand all lines: README.md
+4-1Lines changed: 4 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -44,8 +44,11 @@ for release notes.
44
44
45
45
## Developer setup
46
46
47
-
To get setup for development, see [this video if you prefer VS Code](https://youtu.be/zddl3n1DCFM), or [this older video if you prefer PyCharm](https://youtu.be/QniQi-Hoo9A), and the instructions below.
47
+
To get setup for development, see [this video if you prefer VS Code](https://youtu.be/zddl3n1DCFM), or [this older video if you prefer PyCharm](https://youtu.be/QniQi-Hoo9A).
48
48
49
+
For detailed setup instructions, see:
50
+
-[Linux setup guide](LINUX_SETUP.md)
51
+
-[macOS setup guide](MACOS_SETUP.md)
49
52
Detailed instructions can be found in the [Contributors guide](https://github.com/malariagen/malariagen-data-python/blob/master/CONTRIBUTING.md).
Copy file name to clipboardExpand all lines: malariagen_data/anoph/snp_frq.py
+5-5Lines changed: 5 additions & 5 deletions
Original file line number
Diff line number
Diff line change
@@ -125,15 +125,15 @@ def snp_effects(
125
125
A dataframe of SNP allele frequencies, one row per variant allele. The variant alleles are indexed by
126
126
their contig, their position, the reference allele, the alternate allele and the associated amino acid change.
127
127
The columns are split into three categories: there is one column for each taxon filter (e.g., pass_funestus, pass_gamb_colu, ...) containing whether the site of the variant allele passes the filter;
128
-
there is then 1 column for each cohort containing the frequency of the variant allele within the cohort, additionally there is a column `max_af` containing the maximum allele frequency of the variant allele accross all cohorts;
128
+
there is then 1 column for each cohort containing the frequency of the variant allele within the cohort, additionally there is a column `max_af` containing the maximum allele frequency of the variant allele across all cohorts;
129
129
finally, there are 9 columns describing the variant allele: `transcript` contains the gene transcript used for this analysis,
130
130
`effect` is the effect of the allele change,
131
131
`impact`is the impact of the allele change,
132
132
`ref_codon` is the reference codon,
133
133
`alt_codon` is the altered codon with the variant allele,
134
134
`aa_pos` is the position of the amino acid,
135
135
`ref_aa` is the reference amino acid,
136
-
`alt_aa` is the altered amino acid with the varaint allele,
136
+
`alt_aa` is the altered amino acid with the variant allele,
137
137
and `label` is the label of the variant allele.
138
138
""",
139
139
notes="""
@@ -296,15 +296,15 @@ def snp_allele_frequencies(
296
296
returns="""
297
297
A dataframe of amino acid allele frequencies, one row per variant. The variants are indexed by
298
298
their amino acid change, their contig, their position.
299
-
The columns are split into two categories: there is 1 column for each cohort containing the frequency of the amino acid change within the cohort, additionally there is a column `max_af` containing the maximum frequency of the amino acide change accross all cohorts;
299
+
The columns are split into two categories: there is 1 column for each cohort containing the frequency of the amino acid change within the cohort, additionally there is a column `max_af` containing the maximum frequency of the amino acid change across all cohorts;
300
300
finally, there are 9 columns describing the variant allele: `transcript` contains the gene transcript used for this analysis,
301
301
`effect` is the effect of the allele change,
302
302
`impact`is the impact of the allele change,
303
-
`ref_allele` is the reference allel,
303
+
`ref_allele` is the reference allele,
304
304
`alt_allele` is the alternate allele,
305
305
`aa_pos` is the position of the amino acid,
306
306
`ref_aa` is the reference amino acid,
307
-
`alt_aa` is the altered amino acid with the varaint allele,
307
+
`alt_aa` is the altered amino acid with the variant allele,
0 commit comments