显示标签为“science_general”的博文。显示所有博文
显示标签为“science_general”的博文。显示所有博文

2013年2月11日星期一

rOpenSci is a collaborative effort to develop R-based tools for facilitating Open Science

At rOpenSci we are creating packages that allow access to data repositories through the R statistical programming environment that is already a familiar part of the workflow of many scientists. We hope that our tools will not only facilitate drawing data into an environment where it can readily be manipulated, but also one in which those analyses and methods can be easily shared, replicated, and extended by other researchers. While all the pieces for connecting researchers with these data sources exist as disparate entities, our efforts will provide a unified framework that will be quickly connect researchers to open data.

http://ropensci.org/

rentrez - search or download data from various NCBI databases.


rentrez provides functions that work with the NCBI eutils to search or download data from various NCBI databases.
The package hasn't been thoroughly tested yet, but the functions for each of the Eutils functions are implimented. If you try the package and find bugs please let me know.

2012年12月7日星期五

The Best Scientific Figures of 2012

http://www.wired.com/wiredscience/2012/12/science-figures-2012/?pid=5582&viewall=true


Humanity's Recent Evolution

It's easy to think that modern life has slowed human evolution, but the opposite is true. Most of humanity's genetic variation has accumulated in the last few thousand years during a period of extraordinary population growth. As a species, we're more evolvable than ever.
In the figure above, taken from an in-depth genetic analysis of 6,515 people, the amount of genetic variation collectively present at each location in the human genome is tabulated from before (left) and after (right) the population boom.
Citation: “Analysis of 6,515 exomes reveals the recent origin of most human protein-coding variants.” By Wenqing Fu, Timothy D. O’Connor, Goo Jun, Hyun Min Kang, Goncalo Abecasis, Suzanne M. Leal, Stacey Gabriel, David Altshuler, Jay Shendure, Deborah A. Nickerson, Michael J. Bamshad, NHLBI Exome Sequencing Project & Joshua M. Akey. Vol. 491, No. 7426, 29 November 2012

2012年11月5日星期一

Install R and Rstudio in Ubuntu


Install R in Ubuntu is extremely easy if you don’t meet any exception, but if you meet, then you’d better be a very advanced linux user :-)
Install R
Because the Ubuntu official source R version is usually half of years older than R-project official source, so it is recommanded to using r-project.org official source to install the latest R system.
vi /etc/apt/sources.list
1
2
3
# append below line to end of sources.list
# you can view mirror at http://cran.r-project.org/mirrors.html
deb http://ftp.ctex.org/mirrors/CRAN/bin/linux/ubuntu precise/
import the GPG key and install r-base
1
2
3
4
5
cd ~
gpg --keyserver hkp://keyserver.ubuntu.com:80 --recv-keys E084DAB9
gpg -a --export E084DAB9 | sudo apt-key add -
apt-get upgrade
apt-get install r-base
Install Oracle DB access package
You can found new version of ROracle or DBI package in CRAN, it is also required you properly install the Oracle Instant Client.
manual install the ROracle
1
2
3
4
wget http://cran.r-project.org/src/contrib/DBI_0.2-5.tar.gz
R CMD INSTALL DBI_0.2-5.tar.gz
wget http://cran.r-project.org/src/contrib/ROracle_1.1-5.tar.gz
R CMD INSTALL --configure-args='--with-oci-inc=/opt/oracle/instantclient_11_2/sdk/include' ROracle_1.1-5.tar.gz
Install RStudio Server
1
2
3
4
5
apt-get install libssl0.9.8 # must install even you have newer version
apt-get install libapparmor1 apparmor-utils
wget http://download2.rstudio.org/rstudio-server-0.96.331-i386.deb
dpkg -i rstudio-server-0.96.331-i386.deb
rstudio-server verify-installation
Do some RStudio Server setting
1
2
3
4
5
6
echo 'rsession-memory-limit-mb=1000' > /etc/rstudio/rserver.conf
echo 'rsession-stack-limit-mb=4' >> /etc/rstudio/rserver.conf
echo 'rsession-process-limit=20' >> /etc/rstudio/rserver.conf
# Only pass below if you will using proxy mode
echo 'www-address=127.0.0.1' >> /etc/rstudio/rserver.conf
groupadd rstudio
Setting the proxy server for RStudio server
This section is optional, assured already install nginx in server.
do not forgot link to /opt/nginx/conf/vhosts
1
2
3
4
5
6
7
8
server {
  listen       80;
  server_name  cvprstudio;
  location / {
    proxy_pass http://localhost:8787;
    proxy_redirect http://localhost:8787/ $scheme://$host/;
  }
}
Setting auto restart and PATH
1
2
ln -s /usr/lib/rstudio-server/extras/init.d/debian/rstudio-server /etc/init.d/rstudio-server
vi /etc/init.d/rstudio-server
append below line to /etc/init.d/rstudio-server SCRIPTNAME
1
2
3
4
ORACLE_BASE=/opt/oracle
ORACLE_HOME=/opt/oracle/instantclient_11_2
TNS_ADMIN=/opt/oracle/network/admin
NLS_LANG=AMERICAN_AMERICA.AL32UTF8
Now you can restart/start via standard init.d service way
1
/etc/init.d/rstudio-server restart
Add a user in RStudio
1
2
adduser --ingroup rstudio cindy
passwd cindy # setting password
Update package
Usually it is more good to upgrade the r-base in system wide packages instead of per user

2012年3月16日星期五

really good ideas and guides on fast publication

go there to see the details, http://yihui.name/en/2012/03/a-really-fast-statistics-journal/

Followings are my comments:


Dear Yihui, Very Excellent Ideas and applicable tech guides. I agree well with you, though I am not working in pure stat area, but in Biology. The same issues you pointed out here, occurred also for Biology and for all other academic publishing areas (I guess). We definitely need some improvements in formatting, data/code sharing, and also peer review process.

In BioMed area, f1000research (http://f1000research.com/) has made the first step. "F1000 Research will offer immediate publication; open, post-publication peer review; open revisioning of work including ongoing updates;and encourage raw data deposition and publication. In addition, F1000Research will accept a broad range of article formats and content types." I think F1000research, by its post-publication peer-review rule, would be an ideal one fit your expectation (fast publication). Do you want to express any of your ideas on post-publication peer-review? I have not found them in this blog, though I have felt your passions on that.

2012年2月22日星期三

匹夫無罪,懷璧其罪

百姓本沒有罪,因身藏璧玉而獲罪。原指財寶能致禍。後亦比喻有才能、有理想而受害。


    “匹夫无罪,怀璧其罪”出自《春秋左传-桓公十年》,原文如下:“初,虞叔有玉,虞公求旃。弗献。既而悔之,曰:‘周谚有之:匹夫无罪,怀璧其罪。吾焉用此,其以贾害也?’乃献之。又求其宝剑。叔曰:‘是无厌也。无厌,将及我。’遂伐虞公。故虞公出奔共池。”

Air Pollution Linked to Heart and Brain Risks

研究表明:空气污染可能导致心脏和大脑疾病

2012年2月13日星期一

Mathematics in Biology


Mathematics in Biology


这是 Science 的生物数学专辑。8年前曾经给我触动,今天又偶然遇到它。

2012年2月6日星期一

why we study phenotypic variations

the proteome is closer to the phenotype than the genome or the transcriptomeand as such may be more directly responsive to natural selectionand thus closely linked to adaptation


I just cite a sequence here.

2012年2月1日星期三

2012年1月29日星期日

I do not agree with him


Cultural history holds back Chinese research


We can not freely choose culture for ourself in China now. Everything has been controlled, especially those of culture. So how do you conclude as this, Peng Gong, a so-called intellectual in China? It is not the right time to think if cultural hold back research. It is the time for us to think how/at what level the communism politic depleted our freedom of ....

Do you know what happened in the period of the Cultural Evolution? We have already no culture for at least tens of years. 

2012年1月28日星期六

2012年1月20日星期五

The "impact factor" revisited - 影响因子的前生后世

你知道什么是杂志的影响因子吗?它的目的是什么?它和什么没关系?请看影响因子的前生后世。

2012年1月19日星期四

a logical progression for scientific research 做科研的逻辑

the logical progression:


(1) identifying the research question,
(2) characterizing the data
(3) formulating the model 
(4) drawing valid conclusions