CUHK Research Data Repository (Chinese University of Hong Kong)
Not a member yet
132 research outputs found
Sort by
Text Analysis and Visualisation of The Observatory Review from Hong Kong Early Tabloid Newspaper (香港早期小報《天文臺》的文本分析與可視化 )
Text Analysis and Visualisation of The Observatory Review from Hong Kong Early Tabloid Newspaper
The "Hong Kong Early Tabloid Newspapers"《香港早期小報》, launched in 2022, is a collection of early tabloid newspapers digitized for public access. The collection covers over 80 titles in over 5000 issues of tabloid newspapers published in Hong Kong from 1914 to 1993. In contrast to the major newspapers which focus on reporting "serious" public affairs, tabloid newspapers pay attention to minor topics of interests and entertainments. It serves as an alternative platform for a variety of issues like political secrets, dramas, pornography, and leisure etc. Apart from the above, thinner in size and cheaper in price made tabloid newspapers more accessible and attractive for the public. Tabloid newspapers provides an important insight to the multifaceted society and cultural circulation at the time.
Project goal
We happen to have, as a by-product of the Tabloid Newspaper digitization process, the newspapers' text extracted with Optical Character Recognition (OCR) service. Hence, we started this spin-off pilot project, as an experiment, aiming to salvage valuable information from the data that's sitting around. We also tried to explore in elementary analysis and visualization to improve accessibility for the public. Hopefully, through this project, we might be able to apply similar model to other collections to extract meaningful, important history pieces for future display and study
Global burden, risk factors, and trends of esophageal cancer: An analysis of cancer registries from 48 countries
This data archive contains the data that supplement the manuscript in Cancers titled “Global burden, risk factors, and trends of esophageal cancer: An analysis of cancer registries from 48 countries”. The data consist of data source of the analysis of esophageal cancer, incidence and mortality by region and histological subtype, incidence and mortality trends of esophageal cancer, trend analysis by country, joinpoint regression analysis, and the AAPC of the incidence of esophageal cancer in individuals aged <50 years
Detention population demographics at Tai Tam Gap Correctional Institution (TTGCI or TTG)
The RGC-funded project 'Immigration Detention and Vulnerable Migrants in Hong Kong' ran from 2020 - 2023. The project used socio-legal and mixed methods, including collecting data from official platforms and via the Code on Access to Information.
Source / Credit: All data in this dataset is official data from the HKSAR Government.
As of 30 Aug 2023, ready-for-use Excel/CSV files of the uploaded datasets were not already available on data.gov.hk or Immigration Department platforms. Thus they have been cleaned and made available here. For updated information, please refer to official sources or file an access to information request via the Code of Access to Information.</p
Genome assembly of the rare and endangered Grantham’s camellia, Camellia granthamiana
Grantham’s camellia (Camellia granthamiana Sealy) is a rare and endangered tea species discovered in Hong Kong in 1955 and endemic to southern China. Despite its high conservation value, the genomic resources of C. granthamiana are limited. Here, we present a chromosome-scale draft genome of the tetraploid C. granthamiana (2n = 4x = 60), combining PacBio long-read sequencing and Omni-C data. The assembled genome size is ∼2.4 Gb, with most sequences anchored to 15 pseudochromosomes resembling a monoploid genome. The genome has high contiguity, with a scaffold N50 of 139.7 Mb, and high completeness (97.8% BUSCO score). Our gene model prediction resulted in 68,032 protein-coding genes (BUSCO score of 90.9%). We annotated 1.65 Gb of repeat content (68.48% of the genome). Our Grantham’s camellia genome assembly is a valuable resource for investigating Grantham’s camellia’s biology, ecology, and phylogenomic relationships with other Camellia species, and provides a foundation for further conservation measures
Learning-based density-equalizing map
The code and data for the paper "Learning-based density-equalizing map
9. Detection of SNP and InDels
Detection of Single Nucleotide Polymorphism (SNP) and Insertion-Deletions (InDels)
Does source similarity type matter in online review effectiveness? The moderating role of information processing style
Does source similarity type matter in online review effectiveness? The moderating role of information processing styl
Does theory of planned behaviour play a role in predicting uptake of colorectal cancer screening? A cross-sectional study in Hong Kong
This data archive contains the data that supplement the manuscript in BMJ Open titled “Does theory of planned behaviour play a role in predicting uptake of colorectal cancer screening? A cross-sectional study in Hong Kong”. The data consist of the participation rates of colorectal cancer screening by faecal immunochemical test in selected regions, a questionnaire on the constructs of Theory of Planned Behaviour model towards colorectal cancer screening, and a flow chart on participants recruitment
1. Data extracted from eligible meta-analysis papers
This is an Excel dataset containing the data (texts and numbers) extracted from 333 eligible meta-analysis papers