Localization of Languages for eGovernance and RTIA

eGovINDIA, INDIA RTI, INDIA WBA,

TDIL-MCIT,GoI – Technology Development for Indian Languages, Government of India

TDIL-MCIT,GoI Technology Development for Indian Languages, Government of India

 

http://tdil.mit.gov.in/news.htm

 

  Omvikas@mit.gov.in

http://tdil.mit.gov.in/apr_2005.htm

 

Transiting from catch-up to competitive phase ICT
is the enabling technology for productivity
enhancement and socioeconomic development.
There is sprawling digital divide between nations
basically due to non-affordability of IT appliances
and non-availability of requisite computing facility �
fonts and software tools � in local languages.
Conscious efforts are being made through UNESCO
in this direction. Certain nations have also initiated
their own programmes to develop information
processing tools and applications in their languages.
India has taken a major initiative towards ensuring
ICT in local languages for rapid socio-economic
development and participation of people at large.
With this backdrop, the national mission on
�Technology Development for Indian Languages�
promoted development of basic information
processing tools and linguistic resources for Indian
languages through academic and government
supported R&D institutions spread over the country.
There are 22 officially recognized languages and 11
scripts in vogue in India. One script may be used for
more than one language. Hence, India is a unique
country with multilingual, multi-script environment.
Although with her talented manpower, India has
emerged as pioneer in Information Technology, with
the ability, to handle turnkey projects from abroad at
lower cost, higher accuracy and at faster pace.
However, purchasing power as compared to
developed nations such as USA is 1/15. In India
illiteracy persist at about 34.6% and literacy in
English is less than 5%. This necessitates support
for R&D in the focused areas to develop basic
information processing technologies and
consolidating and packaging them in user friendly
form and releasing them to the masses for free-use.
This will greatly promote people�s participation in
the process of using this technology and developing
innovative products and services.

 

Besides, the activities towards preparing for knowledge based
society would be through the projects such as cross
lingual information access, multilingual search,
voice-enabled query system, etc. Report on the
release of fonts and basic information processing
tools for Tamil and Hindi entails list of various tools
and linguistic resources being available for free use.
Similar releases will follow for other languages.
ELITEX-2005 showcased the multilingual products
and services under the theme �Grass root
applications using ICT�.

There is lot more in the open domain which needs to
be searched and integrated appropriately into user-friendly
tools/ products for free use in the multilingual environment.
UNESCO downloads covers the free downloadable software
on UNESCO website under different categories such as
Operating System, Digital Library, Development Tools,
Productivity Tools, Science, etc. Identification of such tools
and integrating them into user friendly products and services
for mass use in developing countries is desirable. Prof. R.K. Joshi,
highlights the issues about Font Designing and Font Technology
in the context of complex Indian scripts in great detail in the article
�Indian Language: Font Designing and Font Technology�. The
article on �Smart Fonts� discusses various issues related
to smart font such as what is Smart Fonts, what are different
smart font formats, what is Graphite, why was Graphite developed,
how is Graphite different from OpenType and other complex
script technologies? �LaTex� tool is a powerful macro-based
text formatter written by Donald Knuth, and is popular in the
scientific community. LaTeX is the de facto standard for the
communication and publication of scientific documents. This
is in open domain and can be used for multilingual publication.
The article on �Digital Library of India� describes various
activities like digitization of books, manuscripts, magazines,
journals, etc., and development of Digital Library tools like
Cross-lingual Information Access, Multilingual Crawler, Multimodal
Interface for physically challenged, Automatic search Indexing
tools, Multimodal authoring tools, Text summarization, Folk
songs search and retrieval techniques undertaken at different
centres like CDAC Kolkata, IGNCA Delhi, IIIT Hyderabad,
IIIT Allahabad is covered. Greenstone: a suite of software for
building and distributing fully-searchable, metadata-driven digital
library, digital library collections, and features of ABBYY FineReader 7.0
Professional Edition, optical character recognition (OCR) system,
with high-recognition accuracy and format retention facility are
also described. These basic technologies will enhance IT applications
in local languages and facilitate the country transit from catch-up
phase into competitive phase.

October 25, 2006 Posted by localization | TDIL | | No Comments Yet

Localization of Languages is a MUST for EFFECTIVE Implementation of eGovernance, RTI Act and all other Acts of INDIA.

Main objective of use of Information and Communication Technology in e-Government activities is to bring the information and services more effectively to the citizens with the final objective of improving their standard of living. In order to provide information that could be useful to the citizens, information should be provided in languages that could be best understood by them. Therefore use of local languages in all the e-Government projects is fundamental for those projects to be successful.

How many OFFICIAL Languages we have in INDIA ?

There are 18 to 20 official language.

 

For effective implementation of eGovernance of INDIA,  RTI Act 2005 and other Acts, INDIA need to promote LOCALIZATION of LANGUAGES if it wants to be properly implemented in INDIA.

 

Only 5 % in INDIA speak English. Only 30 to 40 % in INDIA knows Hindi.

85 % to 90 % of people speak there own STATE LANGAUGE. We need to CATER to these people. This should happen in INDIA. GOI needs to think about this for it’sfuture. Real RTI Actvists who are in the field knows about this probelms.

 

 

Right to Info Act hits language barrier

http://dnaindia.com/report.asp?NewsID=9593

 

Minister Dayanidhi Maran Says Only FIVE percent speak ENGLISH in INDIA. SO 95% of population needs to know RTI rules and regulations in different Languages of INDIA.

 

‘Connectivity, computers, content rule broadband’

PRESS TRUST OF INDIA

Posted online: Thursday, November 10, 2005 at 1837 hours IST
http://www.financialexpress.com/latest_full_story.php?content_id=108240

 

“If we are to boost the broadband penetration in the country we need content in English as well as in all other regional languages. After all only five per cent people speak English,” Dayanidhi Maran said.

 

 

According to Information Technology Secretary Brijesh Kumar, “In order to push PC penetration beyond the urbane limits, there is a strong need to make the applications and content available to users in their native language. From December, users will have an option to use the computer not just in English but also in Hindi, Tamil and Telugu, especially since the software will come free of cost.”

 

http://www.hindustantimes.com/news/181_1546385,0002.htm

 

According to Kumar, the key drivers that will drive exponential growth for this market will be the newer areas of application for local language IT, broad based e-governance initiatives that will employ local language as a front end to disseminate government services to citizens. “Bundling of multi-lingual software with PCs and other access devices will prove to be a catalyst in promoting PC usage in the country as it did in countries like China,” he added.

 

Zenith Computers CEO Raj Saraf, “Given that only three per cent of the Indian population can speak in English while close to 40 per cent of the Indian population speaks Hindi or one of its variants, language tools are the only way sales can break the present saturation point.”

October 25, 2006 Posted by localization | DIT_MIT | | No Comments Yet

As per PRESIDENTIAL ORDER of 1960, Language of Acts, Bills, etc. -

As per PRESIDENTIAL ORDER of 1960,

11.Language of Acts, Bills, etc. -

(a) The Committee has expressed the opinion that Parliamentary legislation may continue to be in English but an authorised translation should be provided in Hindi. The Ministry of Law may, in due course, initiate necessary legislation to provide for an authorised Hindi translation of Parliamentary legislation which may continue to be in English. Arrangements may be made by the Ministry of Law also for providing translations of Parliamentary legislation into the regional languages.

(b) The Committee has expressed the opinion that where the original text of Bills introduced in or Acts passed by the State legislature is in language other than Hindi, a Hindi translation may be published with it besides an English translation as provided in clause (3) of article 348.

In due course, legislation may be initiated for the publication of a Hindi translation of State Bills, Acts, and other statutory instruments, along with the text in the official language of the State.

The following Government Resolution, as adopted by both Houses of Parliament, is hereby published for general information:-

RESOLUTION

“WHEREAS under article 343 of the Constitution, Hindi shall be the official language of the Union, and under article 351 thereof it is the duty of the Union to promote the spread of the Hindi Language and to develop it so that it may serve as a medium of __expression for all the elements of the composite culture of India;

This House resolves that a more intensive and comprehensive programme shall be prepared and implemented by the Government of India for accelerating the spread and development of Hindi and its progressive use for the various official purposes of the Union and an annual assessment report giving details of the measures taken and the progress achieved shall be laid on the Table of both Houses of Parliament and sent to all State Governments;

 

2. WHEREAS the Eighth Schedule of the Constitution specifies 14 major languages of India besides Hindi, and it is necessary in the interest of the educational and cultural advancement of the country that concerted measures should be taken for the full development of these languages;

The House resolves that a programme shall be prepared and implemented by the Government of India, in collaboration with the State Governments for the coordinated development of all these languages, alongside Hindi so that they grow rapidly in richness and become effective means of communicating modern knowledge;

3. WHEREAS it is necessary for promoting the sense of unity and facilitating communication between people in different parts of the country that effective steps should be taken for implementing fully in all States the three-language formula evolved by the Government of India in consultation with the State Government;

This House resolves that arrangements should be made in accordance with that formula for the study of a modern Indian language, preferably one of the Southern languages, apart from Hindi and English in the Hindi speaking areas and of Hindi along with the regional languages and English in the non-Hindi speaking areas;

4. AND WHEREAS it is necessary to ensure that the just claims and interest of people belonging to different parts of the country in regard to the public services of the Union are fully safeguarded:

This House resolves –

(a) that compulsory knowledge of either Hindi or English shall be required at the stage of selection of candidates for recruitment to the Union services or posts except in respect of any special services or posts for which a high standard of knowledge of English alone or Hindi alone, or both, as the case may be, is considered essential for the satisfactory performance of the duties of any such service or post; and

 

(b) that all the languages included in the Eighth Schedule to the Constitution and English shall be permitted as alternative media for the All India and higher Central Services examinations after ascertaining the views of the Union Public Service Commission on the future scheme of the examinations, the procedural aspects and the timing.

October 25, 2006 Posted by localization | Govt. of INDIA | | No Comments Yet

The Linux Localisation Initiative

The Linux Localisation Initiative

http://lli.linux-bangalore.org/articles/articlesr.php?rel=1

linux-bangalore.org Author : Frank Pohlmann

October 23, 2006 Posted by localization | LINUX | | No Comments Yet

C-DAC launches Janabhaaratii FLOSS localisation project

C-DAC launches Janabhaaratii FLOSS localisation project

http://www.linux-india.org/index.pl?id=3055&isa=NewsArticle&op=show

C-DAC, the Indian government-run Centre for the Development of Advanced Computing, has announced in October 2004 the launch of the launch of its “janabhaaratii” project for localisation of FLOSS. It suggested the goal would be “development, deployment and community building, and the work would be put out under the GPL/LGPL licenses.

This, it said, would aim “to contribute to the community efforts in developing a software suite based on GNU/Linux to be made available in Indian languages”. This project is being funded by the Government of India’s Technology Development in Indian Languages (TDIL) group of the Department of Information Technology,

More details from the official press release are http://www.cdacindia.com/html/press/4q04/prs_rl158.asp

“The primary objective of the project is to enable the wide spread use of Indian language computing through Free/Open Source systems and applications localized in Indian languages,” the C-DAC announced.

C-DAC is known for a number of other Indian-language computing solutions, but its involvement in the field of Linux and FLOSS is seen to be not as much as it could have done.

Some of the C-DAC’s earlier Indian-language solutions include ALP-Personal (the Apex Language Processor, an Indian language word-processor on DOS), GIST-SDK (a Windows-based application development kit for Indian languages), iplugin (an Indian-language web application development tool for interactive applications which promises browser-independence for MS Windows-based clients), and others.

Its ISM Import is a utility that converts data from other font formats into ISFOC format. LEAP Mail offers e-mail in Indian languages. LISM is a Linux-based application for Indian languages. iLEAP is an Internet-ready word processor, which works on the MS Windows platform. The INSRIPT keyboard tutor is a DOS-based utility meant for learning to use one of the keyboard layouts used to type Indic languages.

C-DAC said its new venture “aims at inviting, building and supporting” community initiatives to “produce and disseminate free/open source software systems, applications and content to help proliferate the use of IT in our society, breaking language barriers and through affordable means to bridge the so-called digital divide.”

This project, it said, would cover the development of technology to suit Indian languages, and target diverse areas such as the home-use, mass applications, education, rural areas, infokiosks, cybercafes, e-governance, and content creation.

“The project inherits and will enhance the access and usability of the resources and capabilities garnered by C-DAC in Indian language technologies over the last 15 years,” it was officially stated.

C-DAC, Mumbai — one of the centres of this institution — sees itself as having been “at the forefront” of the Indian language technology mission with its INDIX project for making the GNU/Linux operating systems specifically intelligent about indic scripts and “provide the world with a more generic approach to deal with these complex scripts”.

(C-DAC reminded that many of the fonts developed under the INDIX project are now also available to the public from the TDIL website-http://www.tdil.mit.gov.in/download/openfonts.htm )

Acknowledging the work of others — including determined volunteers — C-DAC said the project “stands on the shoulders of several teams working on the localization of GNU/Linux, namely, the IITs, IIITs, Indlinux, ankurbangla, HBCSE (TIFR), FSF India, and corporations like IBM and so on”.

These groups have put in strenuous efforts to work towards the localisation of computing in India, sometimes with amazing results.

“Several resource centres supported by the TDIL have also stacked up a large amount of localization resources. The project envisages the threading together of these efforts, to present to the community a suite of software components in ready usable form through Open Source under GPL/LGPL licences,” said the C-DAC.

Dr. Alka Irani of C-DAC Mumbai has been named the chief investigator for the project. Prof. Jitendra Shah from Veermata Jijabai Technical Institute (VJTI) — a long time localisation campaigner — will also be collaborating on the project, and has joined C-DAC, Mumbai, it was announced.

Said the C-DAC in its official release: “The nature and the scope of the project being such, C-DAC also extends an invitation to language specialists, linguists, computer specialists, users, governments (Centre and States), academia (faculty and students) and others to join this mission and bring the benefits of the IT revolution to the majority of the population.” (ENDS)

See story at http://www.cdacindia.com/html/press/4q04/prs_rl158.asp

October 23, 2006 Posted by localization | LINUX | | No Comments Yet

C-DAC launches ‘janabhaaratii’© Project Localization of Free/Open Source Software

C-DAC launches ‘janabhaaratii’© Project Localization of Free/Open Source Software

http://www.cdac.in/html/press/4q04/prs_rl158.asp

Mumbai, October 11, 2004

Centre for Development of Advanced Computing (C-DAC) has announced the launch of its “janabhaaratii” project entitled “Localization of Free/ Open Source Software: Development, Deployment and Community Building” to contribute to the community efforts in developing a software suite based on GNU/Linux to be made available in Indian languages. The project is being funded by the Technology Development in Indian Languages (TDIL) group of the Department of Information Technology, Ministry of Communications and IT, Government of India.

The primary objective of the project is to enable the wide spread use of Indian language computing through Free/Open Source systems and applications localized in Indian languages. It aims at inviting, building and supporting community initiatives to produce and disseminate free/ open source software systems, applications and content to help proliferate the use of IT in our society, breaking language barriers and through affordable means to bridge the so-called digital divide. The project will cover the development of technology to suit Indian languages, its dissemination and deployment into diverse areas such as home use, mass applications, education, rural areas, infokiosks, cybercafes, e-governance, content creation and so on.

The project inherits and will enhance the access and usability of the resources and capabilities garnered by C-DAC in Indian language technologies over the last 15 years. C-DAC, Mumbai has been at the forefront of the Indian language technology mission with its INDIX project for making GNU/Linux operating systems specifically intelligent about indic scripts and provide the world with a more generic approach to deal with these complex scripts. (Many of the fonts developed under the INDIX project are now also available to the public from the TDIL website-http://www.tdil.mit.gov.in/download/openfonts.htm)

Further, the project stands on the shoulders of several teams working on the localization of GNU/Linux, namely, the IITs, IIITs, Indlinux, ankurbangla, HBCSE (TIFR), FSF India, and Corporations like IBM and so on. Several Resource Centres supported by the TDIL have also stacked up a large amount of localization resources. The project envisages the threading together of these efforts, to present to the community a suite of software components in ready usable form through Open Source under GPL/LGPL licences.

Dr. Alka Irani, C-DAC, Mumbai is the chief investigator for the project. Prof. Jitendra Shah from Veermata Jijabai Technical Institute (VJTI) will also be collaborating on the project, and has joined C-DAC, Mumbai.

The nature and the scope of the project being such, C-DAC also extends an invitation to language specialists, linguists, computer specialists, users, governments (Centre and States), academia (faculty and students) and others to join this mission and bring the benefits of the IT revolution to the majority of the population.

Click here to send an eMail

October 23, 2006 Posted by localization | CDAC | | No Comments Yet

Text-to-Speech and Automatic Speech Recognition in Indian Languages (Matrubhasha)

Text-to-Speech and Automatic Speech Recognition in Indian Languages (Matrubhasha)

http://www.ictrt.org.in/modules.php?name=Pages&pa=showpage&id=12
Introduction

In the present era of human computer interaction, the educationally under privileged and the rural communities of India are being deprived of technologies that pervade the growing interconnected web of computers and communications. One good solution for this problem would be computers talking to the common man in the language he is comfortable to communicate in. Indian population has a significant percentage of people who are educationally under-privileged. There are still quite a large number of areas where people do not have the capabilities of 3R’s. The digital divide under such circumstances is constantly on a rise, where on one hand we claim that India is leading in IT and on the other hand, the advances we make are totally inaccessible by a large number of countrymen. Under such circumstances, we cannot expect rural/educationally under-privileged countrymen to use computers and IT products unless we remove the need of being literate, which exists as a barrier between them and computers.

Major Issues

In this information age, storage and retrieval of information in a convenient manner has gained importance. Because of the near-universal adoption of World Wide Web as a repository of information for unconstrained and wide dissemination, information is now broadly available on the Internet and is accessible from remote sites. However, the interaction between the computer and the user is largely through keyboard and screen-oriented systems. In the current Indian context, this restricts the usage to a miniscule fraction of the population, who are both computer-literate and conversant with written English. In order to enable a wider proportion of population to benefit from Information technology, there is a dire need for an interface other than keyboard and screen-interface that is widely in use at present. Speech, being a natural means of communication among human beings, can also provide a consummate platform for man-machine interaction. It is also desirable that human-machine interface permits one’s native language of communication. In the context of a multi-lingual country like India, this can be of immense value to our country where literacy rate is considerably low. Certain efforts are currently been undertaken to develop OS and applications, which support the local languages. Localization efforts have been undertaken by most of the leading OS vendors and promoters, which include Microsoft (Windows), Red Hat (Linux), NCST (Indix), IIT Madras (IndLinux) etc. These OS’s support some of the leading Indian languages by using international coding standards (Unicode). Speech technologies promise to be the next generation user interface. Software application having speech and voice recognition abilities have a better chance to communicate with a large percentage of population which include educationally under-privileged, visually challenged and computer illiterates, if these applications can speak and understand the native language. Hence we put forward the API (Application Programming Interface) Model based on Unicode for Text to Speech Synthesis and Automatic Speech Recognition in Indian languages.

Why Unicode?

Unicode is the international standard that encodes characters in 16 bits as opposed to the ASCII standard encoding 8 bits. The Unicode Standard is the universal character-encoding standard used for representation of text for computer processing. Unicode provides a consistent way of encoding multilingual plain text. The design of Unicode is based on the simplicity and consistency of ASCII but goes far beyond ASCII’s limited ability to encode only the Latin alphabet. The Unicode Standard provides the capacity to encode all of the characters used for the written languages of the world. To keep character coding simple and efficient, the Unicode standard assigns each character a unique numeric value and name. It has laid out provisions for encoding all scripts in the world, and has been agreed upon by all major software providers, as well as international governments, as the most suitable character representation for all major character sets. The advantage of using Unicode as character set will enable the TTS engine to recognize different Indian languages in the same document or character string, where as using a separate encoding format for each of the languages would not support the same. All the content development applications such as Office Suite’s, Star-Office, and Open-Office etc. are already being localized using Unicode. Unicode is already playing a significant part with respect to localization and internationalisation. Since it handles the characters for all languages in a uniform way, it avoids the complexities of different character code architectures. All of the modern operating systems, from PCs to mainframes, support Unicode now or are actively developing support for it. The same is true of databases, as well. In this scenario, Unicode would be the best option to go with, in the context of the current problem being discussed about!

Why API Model?

While we talk about IT being completely in-reach of the common countrymen, it is not just enough to have an operating system, or one specific application that supports local language speech synthesis or speech recognition. The point is that in the current juncture where in the growth and impact of technology is day by day rising exponentially; any software application is a candidate for localization. Hence, we put forward this API model so that any software developer can incorporate speech capabilities into one’s application, thus extending the reach of the product even to the masses.

Mr. Raman and the Matrubhasha Team

October 23, 2006 Posted by localization | Govt. of INDIA | | No Comments Yet

Enable Indian language typing support to the existing e-mail portal of Satyam Infoway (SIFY)

Enable Indian language typing support to the existing e-mail portal of Satyam Infoway (SIFY)

http://www.cdac.in/html/about/success/sify-mail-gist.asp

Problem Statement

Enable users to send & receive e-mails in regional languages viz.
Assamese, Bengali, Gujarati, Hindi, Kannada, Oriya, Punjabi, Sanskrit, Tamil, and Telugu through the existing e-mail facility of mail.sify.com.

Solution:

A customized iPlugin was developed and delivered as per SIFYs requirement, which enables users to type their e-mail in 11 regional languages. The regional languages supported are Assamese, Bengali, Gujarati, Marathi, Hindi, Kannada, Oriya, Punjabi, Sanskrit, Tamil, and Telugu.

When a client logs into mail.sify.com with his e-mail id for the first time, he / she has to select the language in which he / she wants to type the e-mail. Based on the language selected the user will be automatically prompted to download the required language components. This is a one-time process for that particular language. The next time onwards the user can simply log in, select language and start typing.

The components that are downloaded one time at the client end (e.g. for Hindi, Marathi) are
DV_GistFontResourcesforWeb Control: 100KB
iPlugin Control : 164KB

A floating keyboard in the respective language also assists in composing the e-mail. When the e-mail is sent to the respective destination, if the fonts already exist at the recipients end, then the data is properly displayed. However, in case the required fonts are not available then the recipient has to download and execute the .exe attached with the mail which copies the requisite fonts to the fonts folder of the recipients machine. This is again a one-time activity.

Acknowledging the quality of the product developed and the capable services rendered by the GIST group of C-DAC, Pune, Mr K Bhaskar, DGM-Technical, Sify Limited says It has been a pleasure in choosing this product and working with you in understanding and implementing this solution. We have now provided language mail for our users in mail.sify.com

System: Linux as server and PHP as scripting language. The client side computer on which the typing is done, uses Windows OS and IE as the browser.

 

 

October 23, 2006 Posted by localization | CDAC | | 1 Comment

eGovernance through Regional Languages:: 5% Speak English::95% Speak 18 different languages

eGovernance through Regional Languages

http://www.cdac.in/html/egov/article.asp

Out of a billion population, there are only 5% people in India who know or speak English, with balance 95% (950 million) people speaking or practicing in at least 18 different officially recognized languages. This poses both a challenge and an opportunity.

With increasing recognition of information technology in catalyzing economic activity and efficient governance, countries have benefited through eGovernance. In India, application of Indian languages on computers has driven eGovernance initiatives. C-DAC has applied language technologies successfully to a number of eGovernance solutions to deliver efficient Government Services in a transparent manner.

Practically all world economies have recognized importance of information technology in catalyzing economic activity, in efficient governance and in human resource development. They have, therefore, made significant investments and successfully integrated it with the development process in reaping the benefits of the information technology revolution that is taking place globally, to their society. In India, likewise, these developments have impacted the Industrial, Education, Service and Government sectors, and their influence on various applications is increasingly being felt of late.

As the era of digital economy is evolving, a significant impact of these developments has been felt in eGovernance. The questions often asked in the context are:

 

  • How government can become more responsive and accessible?
  • How can the government enhance its role as a catalyst of economic growth?
  • How can one provide better Government services? and
  • How can the Government use advanced technologies for transferring benefits to the society in terms of improving health care, education, administrative and citizen interface services?

eGovernance consequently has emerged as a technologically driven methodology to realize economic prosperity leading to transparency, providing information speedily to all citizens, improving administrative efficiency, improving public services, higher velocity of business, improved productivity and an exciting business opportunity.

In a large, geographically dispersed, demographic multilingual country India, the common thread in implementing and achieving these basic objectives of eGovernance has been the development and adoption of language computing tools and methodologies. The Government officials in various provinces, the non-government functionaries across the country and the people, mostly use their own language in day-to-day work, be it in Government administration at various levels, in business, in profession, in services or in school education. Thus, if the fruits of information technology revolution have to spread to all these participating members, in Government and public, it is best done through the use of computers in their own languages.

Out of a billion population, there are only 5% people in India who know or speak English, with balance 95% (950 million) people speaking or practicing in at least 18 different officially recognized languages. This poses both a challenge and an opportunity.

The Centre for Development of Advanced Computing (C-DAC) has made pioneering contributions in developing Indian language tools with natural language processing, and in evolving script and font standards through its GIST technology, to enable and spread use of computers in various languages. It accordingly took up the initiative of developing important eGovernance solutions in Indian languages, which impact Government and the citizens both. This initiative started in 1997 and has grown to a significant extent by the end of 2001. Significant parameters of this initiative were:

 

  • Improve government’s own functioning
  • Provide better service to citizens in a transparent manner
  • Reduce hassles, corruption and drudgery in various government bodies – public interactive functions


Significant information technology based applications developed and successfully commissioned under this initiative are:

 

  • Public Works Department (Maharashtra State) – Covered Works, Accounts, Employees and Tender management modules networking the various 250 state offices to lead to an improved, transparent and efficient system of works services. Involved an outlay of Rs. 10.5 crores (U.S. $ 2.0 Million) to address Works Services of an average Rs. 2500 crore (US$ 500 Million) of the PWD – Maharashtra. This is now proposed for other States also.

  • Stamp Registration (Maharashtra and U.P. States) – Provided on-line property registration, valuation and report generation across 366 offices at various state administrative units, reducing time from several days to mere 20 minutes for an individual, and increasing number of registered documents from 16 to 40 per day with 10 – 15% revenue increase. Involved an outlay of Rs. 5.5 crores (US$ 1 Million) to address a Statewide annual revenue of Rs. 2000 crores (US$ 400 Million).

  • Municipal Corporations (Karnataka) – Computerized major functions of property tax valuation/collection, issue and record of death/birth certificates, water supply billing, consumer complaints and internal MIS functions providing improved citizen services. Involved an outlay of Rs. 2.5 crores (US$ 0.5 Million) in the first phase of six Corporations to address a budget of Rs. 2000 crores (US$ 400 Million).

  • Octroi collection (Nashik – Maharashtra) – Dispensed with cash collection at remote check posts, providing instant valuation, receipt and reconciliation of accounts in a transparent manner. Total outlay Rs. 60 lakhs (US$ 0.1 Million).

  • Decision Support System (Andhra Pradesh State) – Implemented a data warehouse of land and person data of 60 million population to enable well informed, timely and accurate policy decisions by the Government officials across various departments. Involved an outlay of Rs. 5 crores (US$ 1 Million) to address the total State data.

  • Hospital Management System (Specialty and Government Hospitals) - Implemented to improve Healthcare services for the patients. Involved an outlay of Rs. 1.5 crore (US$ 0.3 Million) over two hospitals of 500 beds and 1500 beds respectively.

  • GIS based Land Management (Industrial Development Corporations) - Implemented at a cost of Rs. 55 lakhs (US$ 0.11 Million) to provide web-based access to land data covering allotment, transfer, mortgage, surrender, etc. of industrial development units, in the particular case for MIDC, Maharashtra.

  • Archives Computerization - This application enables effective scanning and archival of various types of old documents with proper enhancement, indexing and retrieval facilities. This application has been successfully deployed for Department of Archives, Govt. of Maharashtra.

All these applications have been very effective in the achievement of their objective, have involved significant technology component covering web, data warehousing, database architecture, Geomatics, Scan/Archive, advanced software tools, and applied in a most innovative manner, ensured replicability to other organizations by building a licensable software product. These also enabled, through business process re-engineering, an effective organizational change to ease their functions and offer a more productive service to the intended beneficiaries. Their relevance has been equally brought about by providing language technology component of significance to the target user group(s), which have broken any language barriers by enabling users to interact with computers in their own languages.

C-DAC groups were organized activity-wise to implement various development projects that were commissioned during the period 1997-2001. The groups comprised of approximately 100 software professionals who worked over an approximate budget of Rs. 30 crores (US$ 6 million) to address the direct business of over Rs. 6500 crores (US$ 1.3 Billion) of the State Governments annually. This initiate is now being spread to other parts of the country to similarly provide effective solutions and services.

C-DAC is an institution of a 575 regular plus 175 contract staff of high skills in advanced information technologies, training and business activities. It is an autonomous scientific institution of the Ministry of Communications and Information Technology, Govt. of India. It currently operates on an annual budget of Rs. 100 crores (US$ 20 million) and works with a network of 100 plus partner institutions in the private enterprise for providing training and support services countrywide.

Set up over a decade ago, as India’s national initiative for design, development and delivery of high performance computing (supercomputer systems) and solutions based on parallel processing technology, C-DAC has over the years diversified its activities, transferring the expertise it acquired and technologies it developed in the high-end computing to develop and deploy Information Technology (IT) based solutions in various sectors of economy. Through this approach, it has maintained a balance between developing strategic technologies needed in the country in the high performance computing area for achieving self-reliance, and addressing mission critical problems in the science and engineering fields on one hand, and using expertise developed to commercialize its technologies and products to meet the requirements in various sectors of economy on the other.

October 23, 2006 Posted by localization | CDAC | | No Comments Yet

ICT Research and Training Centre, India.

Welcome to the ICT Research and Training Centre, India.

 http://www.ictrt.org.in/

The Government of India is a member of the
Development Gateway Foundation, a World Bank
initiative.

An Information and Communication Technologies (ICT) – Research & Training (R&T) Centre of the Foundation has been set up in Bangalore, India with Centre for Development of Advanced Computing (C-DAC) as the Project Implementing Agency and the Indian Institute of Technology (IIT), Bombay as the first collaborating institution.

 

 Language Technology
The dependency on English and other foreign langauges has become main stream and it influences all avenues of our day-to-day life. The aim of this is to develop Indian language technologies inorder to empower the ‘digitally impoverished’.

Projects:

BharateeyaOO

This project aims at enabling the support of Indian Languages within the OpenOffice.org office suite on the main software platforms, so as to facilitate the digital representation, collection and distribution of information by ensuring access to information technology seamless over natural language barriers. Indian language support in the suite will be through localization of the complete user interface and help, in Hindi, Tamil, Kannada, Punjabi, Gujarati, Telugu, Bengali and Malayalam. Internationalization support will also be developed featuring editing and complex text layout processing in Indian scripts, Indian language currency, calendar and spell checking.

Cross Lingual Information Retrieval

The web is a critical and vast source of information in today’s world. However most of the information is in English, which is understood by less than 5% of the Indian population. Search engines are the primary mechanisms to find information on the web. However most of the search engines do not allow querying in Indian languages. Even though a few offer screen layout and static text in Hindi, the querying and results retrieved are still in English. So a person literate in Indian languages but not well versed in English is deprived of access to a vast store of information. To bridge this Digital Language Divide, one of the key technologies required is Cross Lingual Information Retrieval (CLIR). The proposed CLIR system aims at enabling a person to query the web for documents related to health issues and obtain the results, in Hindi.

Other Initiatives: News And Events:
IndiX DGF Article – BharteeyaOO
LEAP OFFICE 2000 GNU Linux enabled for five Indian Languages
MaTra
SHAKTI OFFICE SUITE
Shree-Lipi Ankur Script Processor
Lila Hindi Pragya
Learn Indian Language Through Artificial Intelligence (LILA)
Speech Synthesis System

 

October 23, 2006 Posted by localization | Govt. of INDIA | | No Comments Yet