#introductions Hi, I'm an astronomer and data scientist at the University of Washington. I work on statistical methods for astronomy, including black holes, asteroids and the sun. I also play harp and take pictures of things (mostly waterfalls and squirrels).
@Ricardus But they're so cute!
@tiana_athriel Hi welcome!
@bgcarlisle Hi! Thank you! :)
@tiana_athriel Your research sounds amazing!
@tiana_athriel You sound pretty awesome!
Hello Tiana!
Welcome to Mastodon
@tiana_athriel hello! welcome!
@tiana_athriel Hello and welcome!
Question: Do you have a favorite scifi book or film that didn't insult your astrological knowledge? 
@tiana_athriel in all seriousness what does that statistical model look like? Are you looking at the a matched filter then using correlation/convolution to look for repeated signals?
@fullywoolly Right now I work a lot with Gaussian Processes. They're magic! :) I have a lot of non-stationary, unevenly sampled time series that may or may not have a periodic signal. Since I don't have *that* many data points, Gaussian Processes are perfect for that task!
@tiana_athriel they do sound magic! Everything I've had to process in an academic setting has either a well-defined system or signal. It is hard to imagine that you can get much of anything without having either of those things! Space is pretty good medium for transmitting light, but over such vast distances the constructive and destructive interference must be brutal.
Important questions, python or Matlab/Octave?
@fullywoolly Not really that much interference, mostly just gas and dust in the way that messes things up. Astronomy data is pretty clean compared to other fields, and the problems are often well-bounded (compared to, say, ecology). I do everything in Python (yay open source!), but also know people who write R, C++ and, of course, the physicists' favourite: Fortran. Matlab/Octave is much rarer.
@tiana_athriel numpy and the recent updates to matplotlib make for a nice processing and output combo.
I thought this was a really great series on how to pick the best color scheme for the data and what you are trying to convey.
https://earthobservatory.nasa.gov/blogs/elegantfigures/2013/08/05/subtleties-of-color-part-1-of-6/
The attached picture was from one of my last classes. Doppler on the x-axis and range on the y. All the stuff clumped in the middle at the top are mountains, an airplane, and then way out (bottom left) is the Aurora. Pretty cool!
@tiana_athriel
R could be nice for the built-in stats tools, but does it really do much more than what has already been written for python? I'd rather have a general purpose language that is good as a workhorse too instead of something so specialized.
I'm sure C++ is nice, especially after you have all your tools written, but that sounds brutal. I'm guessing Fortran is similar (never used it).
I guess Matlab and Octave are used mmore in the engineering realm.
@fullywoolly A lot of statisticians write R, so if you're interested in the newest and shiniest of statistical models, sometimes that's where you have to go.
And sometimes you just have algorithms that are too slow to compute in pure python, even with numpy and scipy.
I'm currently learning Vega/Vega-Lite and the python package Altair for #visualization, which I find much more sensible and fun than matplotlib (though I still do a *lot* of visualizing in that, too)
@tiana_athriel makes sense about the stats people using R.
numpy is already compiled/optimized right? I can understand if you are I/O bound like thrashing the hard disk by reading/writing too frequently or saturating the bus going to memory. That is where languages like C or C++ will be stronger.
Vega looks nice. Would've helped with my last project at work. I haven't tried using tools like that or Pandas. Even though my dataset was coming from a db. I just didn't think about it.
@tiana_athriel squirrel bad!